Sequences of hepatitis C virus genotypes and their use as therapeutic and diagnostic agents

ABSTRACT

The present invention relates to a polynucleic acid composition comprising or consisting of at least one polynucleic acid containing 8 or more contiguous nucleotides corresponding to a nucleotide sequence from the region spanning positions  417  to  957  of the Core/E 1  region of HCV type  3;  and/or the region spanning positions  4664  to  4730  of the NS 3  region of HCV type  3;  and/or the region spanning positions  4892  to  5292  of the NS 3/4  region of HCV type  3;  and/or the region spanning positions  8 023  to  8 235  of the NS 5  region of the BR 36  subgroup of HCV type  3   a  and/or the coding region of HCV type  4   a  starting at nucleotide  379  in the core region; and/or the coding region of HCV type  4;  and/or the coding region of HCV type  5,  with said nucleotide numbering being with respect to the numbering of HCV nucleic acids as shown in Table 1, and with said polynucleic acids containing at least one nucleotide difference with known HCV type  1,  and/or HCV type  2  genomes in the above-indicated regions, or the complement thereof.

[0001] The invention relates to new sequences of hepatitis C virus (HCV)genotypes and their use as therapeutic and diagnostic agents.

[0002] The present invention relates to new nucleotide and amino acidsequences corresponding to the coding region of a new type 2 subtype 2d, type-specific sequences corresponding to HCV type 3 a, to newsequences corresponding to the coding region of a new subtype 3 c, andto new sequences corresponding to the coding region of HCV type 4 andtype 5 subtype 5 a; a process for preparing them, and their use fordiagnosis, prophylaxis and therapy.

[0003] The technical problem underlying the present invention is toprovide new type-specific sequences of the Core, the E1, the E2, theNS3, the NS4 and the NS5 regions of HCV type 4 and type 5, as well as ofnew variants of HCV types 2 and 3. These new HCV sequences are useful todiagnose the presence of type 2 and/or type 3 and/or type 4 and/or type5 HCV genotypes in a biological sample. Moreover, the availability ofthese new type-specific sequences can increase the overall sensitivityof HCV detection and should also prove to be useful for therapeuticpurposes.

[0004] Hepatitis C viruses (HCV) have been found to be the major causeof non-A, non-B hepatitis. The sequences of cDNA clones covering thecomplete genome of several prototype isolates have been determined (Katoet al., 1990; Choo et al., 1991; Okamoto et al., 1991; Okamoto et al.,1992). Comparison of these isolates shows that the variability innucleotide sequences can be used to distinguish at least 2 differentgenotypes, type 1 (HCV-1 and HCV-J) and type 2 (HC-J6 and HC-J8), withan average homology of about 68%. Within each type, at least twosubtypes exist (e.g. represented by HCV-1 and HCV-J), having an averagehomology of about 79%. HCV genomes belonging to the same subtype showaverage homologies of more than 90% (Okamoto et al., 1992). However, thepartial nucleotide sequence of the NS5 region of the HCV-T isolatesshowed at most 67% homology with the previously published sequences,indicating the existence of a yet another HCV type (Mori et al., 1992).Parts of the 5′ untranslated region (UR), core, NS3, and NS5 regions ofthis type 3 have been published, further establishing the similarevolutionary distances between the 3 major genotypes and their subtypes(Chan et al., 1992).

[0005] The identification of type 3 genotypes in clinical samples can beachieved by means of PCR with type-specific primers for the NS5 region.However, the degree to which this will be successful is largelydependent on sequence variability and on the virus titer present in theserum. Therefore, routine PCR in the open reading frame, especially fortype 3 and the new type 4 and 5 described in the present inventionand/or group V (Cha et al., 1992) genotypes can be predicted to beunsuccessful. A new typing system (LiPA), based on variation in thehighly conserved 5′ UR, proved to be more useful because the 5 major HCVgenotypes and their subtypes can be determined (Stuyver et al., 1993).The selection of high-titer isolates enables to obtain PCR fragments forcloning with only 2 primers, while nested PCR requires that 4 primersmatch the unknown sequences of the new type 3, 4 and 5 genotypes.

[0006] New sequences of the 5′ untranslated region (5′UR) have beenlisted by Bukh et al. (1992). For some of these, the E1 region hasrecently been described (Bukh et al., 1993). Isolates with similarsequences in the 5′UR to a group of isolates including DK12 and HK10described by Bukh et al. (1992) and E-b1 to E-b8 described andclassified as type 3 by Chan et al. (1991), have been reported anddescribed in the 5′UR, the carboxyterminal part of E1, and in the NS5region as group IV by Cha et al. (1992; WO 92/19743), and have also beendescribed in the 5′UR for isolate BR56 and classified as type 3 by theinventors of this application (Stuyver et al., 1993).

[0007] The aim of the present invention is to provide new HCV nucleotideand amino acid sequences enabling the detection of HCV infection.

[0008] Another aim of the present infection is to provide new nucleotideand amino acid HCV sequences enabling the classification of infectedbiological fluids into different serological groups unambiguously linkedto types and subtypes at the genome level.

[0009] Another aim of the present invention is to provide new nucleotideand amino acid HCV sequences ameliorating the overall HCV detectionrate.

[0010] Another aim of the present invention is to provide new HCVsequences, useful for the design of HCV vaccine compositions.

[0011] Another aim of the present invention is to provide apharmaceutical composition consisting of antibodies raised against thepolypeptides encoded by these new HCV sequences, for therapy ordiagnosis.

[0012] The present invention relates more particularly to a compositioncomprising or consisting of at least one polynucleic acid containing atleast 5, and preferably 8 or more contiguous nucleotides selected fromat least one of the following HCV sequences:

[0013] an HCV type 3 genomic sequence, more particularly in any of thefollowing regions:

[0014] the region spanning positions 417 to 957 of the Core/E1 region ofHCV subtype 3 a,

[0015] the region spanning positions 4664 to 4730 of the NS3 region ofHCV type 3,

[0016] the region spanning positions 4892 to 5292 of the NS3/4 region ofHCV type 3,

[0017] the region spanning positions 8023 to 8235 of the NS5 region ofthe BR36 subgroup of HCV subtype 3 a,

[0018] an HCV subtype 3 c genomic sequence,

[0019] more particularly the coding regions of the above-specifiedregions;

[0020] an HCV subtype 2 d genomic sequence, more particularly the codingregion of HCV subtype 2 d,

[0021] an HCV type 4 genomic sequence, more particularly the codingregion, more particularly the coding region of subtypes 4 a, 4 e, 4 f, 4g, 4 h, 4 i, and 4j,

[0022] an HCV type 5 genomic sequence, more particularly the codingregion of HCV type 5, more particularly the regions encoding Core, E1,E2, NS3, and NS4

[0023] with said nucleotide numbering being with respect to thenumbering of HCV nucleic acids as shown in Table 1, and with saidpolynucleic acids containing at least one nucleotide difference withknown HCV (type 1, type 2, and type 3) polynucleic acid sequences in theabove-indicated regions, or the complement thereof.

[0024] It is to be noted that the nucleotide difference in thepolynucleic acids of the invention may involve or not an amino aciddifference in the corresponding amino acid sequences coded by saidpolynucleic acids.

[0025] According to a preferred embodiment, the present inventionrelates to a composition comprising or containing at least onepolynucleic acid encoding an HCV polyprotein, with said polynucleic acidcontaining at least 5, preferably at least 8 nucleotides correspondingto at least part of an HCV nucleotide sequence encoding an HCVpolyprotein, and with said HCV polyprotein containing in its sequence atleast one of the following amino acid residues: L7, Q43, M44, S60, R67,Q70, T71, A79, A87, N106, K115, A127, A190, S130, V134, G142, 1144,E152, A157, V158, P165, S177 or Y177, I178, V180 or E180 or F182, R184,I186, H187, T189, A190, S191 or G191, Q192 or L192 or I192 or V192 orE192, N193 or H193 or P193, W194 or Y194, H195, A197 or I197 or V197 orT197, V202, I203 or L203, Q208, A210, V212, F214, T216, R217 or D217 orE217 or V217, H218 or N218, H219 or V219 or L219, L227 or I227, M231 orE231 or Q231, T232 or D232 or A232 or K232, Q235 or I235, A237 or T237,I242, I246, S247, S248, V249, S250 or Y250, I251 or V251 or M251 orF251, D252, T254 or V254, L255 or V255, E256 or A256, M258 or F258 orV258, A260 or Q260 or S260, A261, T264 or Y264, M265, I266 or A266,A267, G268 or T268, F271 or M271 or V271, I277, M280 or H280, I284 orA284 or L84, V274, V291, N292 or S292, R293 or I293 or Y293, Q294 orR294, L297 or I297 or Q297, A299 or K299 or Q299, N303 or T303, T308 orL308, T310 or F310 or A310 or D310 or V310, L313, G317 or Q317, L333,S351, A358, A359, A363, S364, A366, T369, L373, F376, Q386, I387, S392,I399, F402, I403, R405, D454, A461, A463, T464, K484, Q500, E501, S521,K522, H524, N528, S531, S532, V534, F536, F537, M539, I546, C1282,A1283, H1310, V1312, Q1321, P1368, V1372, V1373, K1405, Q1406, S1409,A1424, A1429, C1435, S1436, S1456, H1496, A1504, D1510, D1529, I1543,N1567, D1556, N1567, M1572. Q1579, L1581, S1583, F1585, V1595, E1606 orT1606, M1611, V1612 or L1612, P1630. C1636, P1651, T1656 or I1656,L1663, V1667, V1677, A1681, H1685, E1687, G1689, V1695, A1700, Q1704,Y1705, A1713, A1714 or S1714, M1718, D1719, A1721 or T1721, R1722, A1723or V1723, H1726 or G1726, E1730, V1732, F1735, I1736, S1737, R1738,T1739, G1740, Q1741, K1742, Q1743, A1744, T1745, L1746, E1747 or K1747,I1749, A1750, T1751 or A1751, V1753, N1755, K1756, A1757, P1758, A1759,H1762, T1763, Y1764, P2645, A2647, K2650, K2653 or L2653, S2664, N2673,F2680, K2681, L2686, H2692, Q2695 or L2695 or I2695, V2712, F2715, V2719or Q2719, T2722, T2724, S2725, R2726, G2729, Y2735, H2739, I2748, G2746or I2746, I2748, P2752 or K2752, P2754 or T2754, T2757 or P2757, withsaid notation being composed of a letter representing the amino acidresidue by its one-letter code, and a number representing the amino acidnumbering according to Kato et al., 1990.

[0026] Each of the above-mentioned residues can be found in any of FIGS.2, 5, 7, 11 or 12 showing the new amino acid sequences of the presentinvention aligned with known sequences of other types or subtypes of HCVfor the Core, E1, E2, NS3, NS4, and NS5 regions.

[0027] More particularly, a polynucleic acid contained in thecomposition according to the present invention contains at least 5,preferably 8, or more contiguous nucleotides corresponding to a sequenceof contiguous nucleotides selected from at least one of HCV sequencesencoding the following new HCV amino acid sequences:

[0028] new sequences spanning amino acid positions 1 to 319 of theCore/E1 region of HCV subtype 2 d, type 3 (more particularly newsequences for subtypes 3 a and 3 c), new type 4 subtypes (moreparticularly new sequences for subtypes 4 a, 4 e, 4 f, 4 g, 4 h, 4 i and4j) and type 5 a, as shown in FIG. 5;

[0029] new sequences spanning amino acid positions 328 to 546 of theE1/E2 region of HCV subtype 5 a as shown in FIG. 12;

[0030] new sequences spanning amino acid positions 1556 to 1764 of theNS3/NS4 region of HCV type 3 (more particularly for new subtypes 3 asequences), and subtype 5 a, as shown in FIG. 7 or 11;

[0031] new sequences spanning amino acid positions 2645 to 2757 of theNS5B region of HCV subtype 2 d, type 3 (more particularly for newsubtypes 3 a and 3 c), new type 4 subtypes (more particularly subtypes 4a, 4 e, 4 f, 4 g, 4 h, 4 i and 4 j) and subtype 5 a, as shown in FIG. 2,

[0032] Using the LiPA system mentioned above, Brazilian blood donorswith high titer type 3 hepatitis C virus, Gabonese patients withhigh-titer type 4 hepatitis C virus, and a Belgian patient withhigh-titer HCV type 5 infection were selected. Nucleotide sequences inthe core, E1, NS5 and NS4 regions which have not yet been reportedbefore, were analyzed in the frame of the invention. Coding sequences(with the exception of the core region) of any type 4 isolate arereported for the first time in the present invention. The NS5b regionwas also analyzed for the new type 3 isolates. After having determinedthe NS5b sequences, comparison with the Ta and Tb subtypes described byMori et al. (1992) was possible, and the type 3 sequences could beidentified as type 3 a genotypes. The new type 4 isolates segregatedinto 10 subtypes, based on homologies obtained in the NS5 and E1regions. New type 2 and 3 sequences could also be distinguished frompreviously described type 2 or 3 subtypes from sera collected in Belgiumand the Netherlands.

[0033] The term “polynucleic acid” refers to a single stranded or doublestranded nucleic acid sequence which may contain at least 5 contiguousnucleotides to the complete nucleotide sequence (f.i. at least 6, 7, 8,9, 10, 11, 12, 13, 14, 15 or more contiguous nucleotides). A polynucleicacid which is up till about 100 nucleotides in length is often alsoreferred to as an oligonucleotide. A polynucleic acid may consist ofdeoxyribonucleotides or ribonucleotides, nucleotide analogues ormodified nucleotides, or may have been adapted for therapeutic purposes.A polynucleic acid may also comprise a double stranded cDNA clone whichcan be used for cloning purposes, or for in vivo therapy, orprophylaxis.

[0034] The term “polynucleic acid composition” refers to any kind ofcomposition comprising essentially said polynucleic acids. Saidcomposition may be of a diagnostic or a therapeutic nature.

[0035] The expression “nucleotides corresponding to” refers tonucleotides which are homologous or complementary to an indicatednucleotide sequence or region within a specific HCV sequence.

[0036] The term “coding region” corresponds to the region of the HCVgenome that encodes thy HCV polyprotein. In fact, it comprises thecomplete genome with the exception of the 5′ untranslated region and 3′untranslated region.

[0037] The term “HCV polyprotein” refers to the HCV polyprotein of theHCV-J isolate (Kato et al., 1990). The adenine residue at position 330(Kato et al., 1990) is the first residue of the ATG codon that initiatesthe long HCV polyprotein of 3010 amino acids in HCV-J and other type 1 bisolates, and of 3011 amino acids in HCV-1 and other type 1 a isolates,and of 3033 amino acids in type 2 isolates HC-J6 and HC-J8 (Okamoto etal., 1992).

[0038] This adenine is designated as position 1 at the nucleic acidlevel, and this methionine is designated as position 1 at the amino acidlevel, in the present invention. As type 1 a isolates contain 1 extraamino acid in the NS5a region, coding sequences of type 1 a and 1 b haveidentical numbering in the Core, E1, NS3, and NS4 region, but willdiffer in the NS5b region as indicated in Table 1. Type 2 isolates have4 extra amino acids in the E2 region, and 17 or 18 extra amino acids inthe NS5 region compared to type 1 isolates, and will differ in numberingfrom type 1 isolates in the NS3/4 region and NS5b regions as indicatedin Table 1. TABLE 1 Positions Positions Positions Positions described indescribed for described for described for the HCV-J HCV-1 HC-J6, HC-J8present (Kato et al., (Choo et al., (Okamoto et Region invention* 1990)1991) al., 1992) Nucleotides NS5b 8023/8235 8352/8564 8026/82388433/8645 7932/8271 8261/8600 7935/8274 8342/8681 NS3/4 4664/52924993/5621 4664/5292 5017/5645 4664/4730 4993/5059 4664/4730 5017/50834892/5292 5221/5621 4892/5292 5245/5645 3856/4209 4185/4528 3856/42094209/4762 4936/5292 5265/5621 4936/5292 5289/5645 coding  330/9359  1/9033  342/9439 region of present invention Amino NS5b 2675/27452675/2745 2676/2746 2698/2768 Acids 2645/2757 2645/2757 2646/27582668/2780 NS3/4 1556/1764 1556/1764 1556/1764 1560/1768 1286/14031286/1403 1286/1403 1290/1407 1646/1764 1646/1764 1646/1764 1650/1768

[0039] Table 1: Comparison of the HCV nucleotide and amino acidnumbering system used in the present invention (*) with the numberingused for other prototype isolates. For example, 8352/8564 indicates theregion designated by the numbering from nucleotide 8352 to nucleotide8564 as described by Kato et al. (1990). Since the numbering system ofthe present invention starts at the polyprotein initiation site, the 329nucleotides of the 5′ untranslated region described by Kato et al (1990)have to be substracted, and the corresponding region is numbered fromnucleotide 8023 (“8352-329”) to 8235 (“8564-329”)

[0040] The term “HCV type” corresponds to a group of HCV isolates ofwhich the complete genome shows more than 74% homology at the nucleicacid level, or of which the NS5 region between nucleotide positions 7932and 8271 shows more than 74% homology at the nucleic acid level, or ofwhich the complete HCV polyprotein shows more than 78% homology at theamino acid level, or of which the NS5 region between amino acids atpositions 2645 and 2757 shows more than 80% homology at the amino acidlevel, to polyproteins of the other isolates of the group, with saidnumbering beginning at the first ATG codon or first methionine of thelong HCV polyprotein of the HCV-J isolate (Kato et al., 1990). Isolatesbelonging to different types of HCV exhibit homologies, over thecomplete genome, of less than 74% at the nucleic acid level and lessthan 78% at the amino acid level. Isolates belonging to the same typeusually show homologies of about 92 to 95% at the nucleic acid level and95 to 96% at the amino acid level when belonging to the same subtype,and those belonging to the same type but different subtypes preferablyshow homologies of about 79% at the nucleic acid level and 85-86% at theamino acid level.

[0041] More preferably the definition of HCV types is concluded from theclassification of HCV isolates according to their nucleotide distancescalculated as detailed below

[0042] (1) based on phylogenetic analysis of nucleic acid sequences inthe NS5b region between nucleotides 7935 and 8274 (Choo et al., 1991) or8261 and 8600 (Kato et al., 1990) or 8342 and 8681 (Okamoto et al.,1991), isolates belonging to the same HCV type show nucleotide distancesof less than 0.34, usually less than 0.33, and more usually of less than0.32, and isolates belonging to the same subtype show nucleotidedistances of less than 0.135, usually of less than 0.13, and moreusually of less than 0.125, and consequently isolates belonging to thesame type but different subtypes show nucleotide distances ranging from0 135 to 0.34, usually ranging from 0.1384 to 0.2477, and more usuallyranging from 0.15 to 0.32, and isolates belonging to different HCV typesshow nucleotide distances greater than 0.34, usually greater that 0.35,and more usually of greater than 0.358, more usually ranging from 0 1384to 0.2977.

[0043] (2) based on phylogenetic analysis of nucleic acid sequences inthe core/E1 region between nucleotides 378 and 957, isolates belongingto the same HCV type show nucleotide distances of less than 0.38,usually of less than 0.37, and more usually of less than 0.364, andisolates belonging to the same subtype show nucleotide distances of lessthan 0.17, usually of less than 0.16, and more usually of less than0.15, more usually less than 0.135, more usually less than 0.134, andconsequently isolates belonging to the same type but different subtypesshow nucleotide distances ranging from 0.15 to 0.38, usually rangingfrom 0.16 to 0.37, and more usually ranging from 0.17 to 0.36, moreusually ranging from 0.133 to 0.379, and isolates belonging to differentHCV types show nucleotide distances greater than 0.34, 0.35, 0.36,usually more than 0.365, and more usually of greater than 0.37,

[0044] (3) based on phylogenetic analysis of nucleic acid sequences inthe NS3/NS4 region between nucleotides 4664 and 5292 (Choo et al., 1991)or between nucleotides 4993 and 5621 (Kato et al., 1990) or betweennucleotides 5017 and 5645 (Okamoto et al., 1991), isolates belonging tothe same HCV type show nucleotide distances of less than 0.35, usuallyof less than 0.34, and more usually of less than 0.33, and isolatesbelonging to the same subtype show nucleotide distances of less than0.19, usually of less than 0.18, and more usually of less than 0.17, andconsequently isolates belonging to the same type but different subtypesshow nucleotide distances ranging from 0.17 to 0.35, usually rangingfrom 0.18 to 0 34, and more usually ranging from 0.19 to 0.33, andisolates belonging to different HCV types show nucleotide distancesgreater than 0.33, usually greater than 0.34, and more usually ofgreater than 0.35. TABLE 2 Molecular evolutionary distances Core/E1 E1NS5B NS5B Region 579 bp 384 bp 340 bp 222 bp Isolates* 0.0017 − 0.13470.0026 − 0.2031 0.0003 − 0.1151  0.000 − 0.1323 (0.0750 ± 0.0245)(0.0969 ± 0.0289) (0.0637 ± 0.0229) (0.0607 ± 0.0205) Subtypes* 0.1330 −0.3794 0.1645 − 0.4869 0.1384 − 0.2977  0.117 − 0.3538 (0.2786 ± 0.0363)(0.3761 ± 0.0433) (0.2219 ± 0.0341) (0.2391 ± 0.0399) Types* 0.3479 −0.6306 0.4309 − 0.9561 0.3581 − 0.6670 0.3457 − 0.7471 (0.4703 ± 0.0525)(0.6308 ± 0.0928) (0.4994 ± 0.0495) (0.5295 ± 0.0627)

[0045] In a comparative phylogenetic analysis of available sequences,ranges of molecular evolutionary distances for different regions of thegenome were calculated, based on 19,781 pairwise comparisons by means ofthe DNA DIST program of the phylogeny inference package PHYLIP version3.5C (Felsenstein, 1993). The results are shown in Table 2 and indicatethat although the majority of distances obtained in each region fit withclassification of a certain isolate, only the ranges obtained in the 340bp NS5B-region are non-overlapping and therefor conclusive. However, aswas performed in the present invention, it is preferable to obtainsequence information from at least 2 regions before final classificationof a given isolate.

[0046] Designation of a number to the different types of HCV and HCVtypes nomenclature is based on chronological discovery of the differenttypes. The numbering system used in the present invention might stillfluctuate according to international conventions or guidelines. Forexample, “type 4” might be changed into “type 5” or “type 6”.

[0047] The term “subtype” corresponds to a group of HCV isolates ofwhich the complete polyprotein shows a homology of more than 90% both atthe nucleic acid and amino acid levels, or of which the NS5 regionbetween nucleotide positions 7932 and 8271 shows a homology of more than90% at the nucleic acid level to the corresponding parts of the genomesof the other isolates of the same group, with said numbering beginningwith the adenine residue of the initiation codon of the HCV polyprotein.Isolates belonging to the same type but different subtypes of HCV showhomologies of more than 74% at the nucleic acid level and of more than78% at the amino acid level.

[0048] The term “BR36 subgroup” refers to a group of type 3 a HCVisolates (BR36, BR33, BR34) that are 95%, preferably 95.5%, mostpreferably 96% homologous to the sequences as represented in SEQ ID NO1, 3, 5, 7, 9, 11 in the NS5b region from position 8023 to 8235.

[0049] It is to be understood that extremely variable regions like theE1, E2 and NS4 regions will exhibit lower homologies than the averagehomology of the complete genome of the polyprotein.

[0050] Using these criteria, HCV isolates can be classified into atleast 6 types. Several subtypes can clearly be distinguished in types 1,2, 3 and 4: 1 a, 1 b, 2 a, 2 b, 2 c, 2 d, 3 a, 3 b, 4 a, 4 b, 4 c, 4 d,4 e, 4 f, 4 g, 4 h, 4 i and 4 j based on homologies of the 5′ UR andcoding regions including the part of NS5 between positions 7932 and8271. An overview of most of the reported isolates and their proposedclassification according to the typing system of the present inventionas well as other proposed classifications is presented in Table 3. TABLE3 HCV CLASSIFICATION OKA- NAKA MOTO MORI O CHA PROTOTYPE 1a I I Pt GIHCV-1, HCV-H, HC-J1 1b II II KI GIII HCV-J, HCV-BK, HCV-T, HC-JK1, HC-J4, HCV-CHINA 1c HC-G9 2a III III K2a GIII HC-J6 2b IV IV K2b GIII HC-J82c S83, ARG6, ARG8, I10, T983 2d NE92 3a V V K3 GIV E-b1, Ta, BR36,BR33, HD10, NZL1 3b VI K3 GIV HCV-TR, Tb 3c BE98 4a Z4, GB809-4 4b Z1 4cGB116, GB358, GB215, Z6, Z7 4d DK13 4e GB809-2, CAM600, CAM736 4fCAM622, CAM627 4g GB549 4h GB438 4i CAR4/1205 4j CAR1/501 4k EG29 5a GVSA3, SA4, SA1, SA7, SA11, BE95 6a HK1, HK2, HK3, HK4

[0051] The term “complement” refers to a nucleotide sequence which iscomplementary to an indicated sequence and which is able to hybridize tothe indicated sequences.

[0052] The composition of the invention can comprise many combinations.By way of example, the composition of the invention can comprise:

[0053] two (or more) nucleic acids from the same region or,

[0054] two nucleic acids (or more), respectively from different regions,for the same isolate or for different isolates,

[0055] or nucleic acids from the same regions and from at least twodifferent regions (for the same isolate or for different isolates).

[0056] The present invention relates more particularly to a polynucleicacid composition as defined above, wherein said polynucleic acidcorresponds to a nucleotide sequence selected from any of the followingHCV type 3 genomic sequences:

[0057] an HCV genomic sequence having a homology of at least 67%,preferably more than 69%, more preferably 71%, even more preferably morethan 73%, or most preferably more than 76% to any of the sequences asrepresented in SEQ ID NO 13, 15, 17, 19, 21, 23, 25 or 27 (HD10, BR36 orBR33 sequences) in the region spanning positions 417 to 957 of theCore/E1 region as shown in FIG. 4;

[0058] an HCV genomic sequence having a homology of at least 65%,preferably more than 67%, preferably more than 69%, even preferably morethan 70%, most preferably more than 74% to any of the sequences asrepresented in SEQ ID NO 13, 15, 17, 19, 21, 23, 25 or 27 (HD10, BR36 orBR33 sequences) in the region spanning positions 574 to 957 of the E1region as shown in FIG. 4;

[0059] an HCV genomic sequence as having a homology of at least 79%,more preferably at least 81%, most preferably more than 83% or more toany of the sequences as represented in SEQ ID NO 147 (representingpositions 1 to 346 of the Core region of HVC type 3 c, sequence BE98) inthe region spanning positions 1 to 378 of the Core region as shown inFIG. 3;

[0060] an HCV genomic sequence of HVC type 3 a having a homology of atleast 74%, more preferably at least 76%, most preferably more than 78%or more to any of the sequences as represented in SEQ ID NO 13, 15, 17,19, 21, 23, 25 or 27 (HD10, BR36 or BR33 sequences) in the regionspanning positions 417 to 957 in the Core/E1 region as shown in FIG. 4;

[0061] an HCV genomic sequence of HCV type 3 a as having a homology ofat least 74%, preferably more than 76%, most preferably 78% or more toany of the sequences as represented in SEQ ID NO 13, 15, 17, 19, 21, 23,25 or 27 (HD10, BR36 or BR33 sequences) in the region spanning positions574 to 957 in the E1 region as shown in FIG. 4;

[0062] an HCV genomic sequence as having a homology of more than 73.5%,preferably more than 74%, most preferably 75% homology to the sequenceas represented in SEQ ID NO 29 (HCC153 sequence) in the region spanningpositions 4664 to 4730 of the NS3 region as shown in FIG. 6;

[0063] an HCV genomic sequence having a homology of more than 70%,preferably more than 72%, most preferably more than 74% homology to anyof the sequences as represented in SEQ ID NO 29, 31, 33, 35, 37 or 39(HCC153, HD10, BR36 sequences) in the region spanning positions 4892 to5292 in the NS3/NS4 region as shown in FIG. 6 or 10;

[0064] an HCV genomic sequence of the BR36 subgroup of HCV type 3 a ashaving a homology of more than 95%, preferably 95,5%, most preferably96% homology to any of the sequences as represented in SEQ ID NO 5, 7,1, 3, 9 or 11 (BR34, BR33, BR36 sequences) in the region spanningpositions 8023 to 8235 of the NS5 region as shown in FIG. 1;

[0065] an HCV genomic sequence of the BR36 subgroup of HCV type 3 a ashaving a homology of more than 96%, preferably 96.5%, most preferably97% homology to any of the sequences as represented in SEQ ID NO 5, 7,1, 3, 9 or 11 (BR34, BR33, BR36 sequences) in the region spanningpositions 8023 to 8192 of the NSSB region as shown in FIG. 1;

[0066] an HCV genomic sequence of HCV type 3 c being characterized ashaving a homology of more than 79%, more preferably more than 81%, andmost preferably more than 83% to the sequence as represented in SEQ IDNO 149 (BE98 sequence) in the region spanning positions 7932 to 8271 inthe NS5B region as shown in FIG. 1.

[0067] Preferentially the above-mentioned genomic HCV sequences depictsequences from the coding regions of all the above-mentioned sequences.

[0068] According to the nucleotide distance classification system (withsaid nucleotide distances being calculated as explained above), saidsequences of said composition are selected from:

[0069] an HCV genomic sequence being characterized as having anucleotide distance of less than 0.44, preferably of less than 0.40,most preferably of less than 0.36 to any of the sequences as representedin SEQ ID NO 13, 15, 17, 19, 21, 23, 25 or 27 in the region spanningpositions 417 to 957 of the Core/E1 region as shown in FIG. 4;

[0070] an HCV genomic sequence being characterized having a nucleotidedistance of less than 0.53, preferably less than 0.49, most preferablyof less than 0.45 to any of the sequences as represented in SEQ ID NO19, 21, 23, 25 or 27 in the region spanning positions 574 to 957 of theE1 region as shown in FIG. 4;

[0071] an HCV genomic sequence characterized having a nucleotidedistance of less than 0.15, preferably less than 0.13, and mostpreferably less than 0.11 to any of the sequences as represented in SEQID NO 147 in the region spanning positions 1 to 378 of the Core regionas shown in FIG. 3;

[0072] an HCV genomic sequence of HVC type 3 a being characterized ashaving a nucleotide distance of less than 0.3, preferably less than0.26, most preferably of less than 0.22 to any of the sequences asrepresented in SEQ ID NO 13, 15, 17, 19, 21, 23, 25 or 27 in the regionspanning positions 417 to 957 in the Core/E1 region as shown in FIG. 4;

[0073] an HCV genomic sequence of HCV type 3 a being characterized ashaving a nucleotide distance of less than 0.35, preferably less than0.31, most preferably of less than 0.27 to any of the sequences asrepresented in SEQ ID NO 13, 15, 17, 19, 21, 23, 25 or 27 in the regionspanning positions 574 to 957 in the E1 region as shown in FIG. 4;

[0074] an HCV genomic sequence of the BR36 subgroup of HCV type 3 abeing characterized as having a nucleotide sequence of less than 0.0423,preferably less than 0.042, preferably less than 0.0362 to any of thesequences as represented in SEQ ID NO 5, 7, 1, 3, 9 or 11 in the regionspanning positions 8023 to 8235 of the NS5 region as shown in FIG. 1;

[0075] an HCV genomic sequence of HCV type 3 c being characterized ashaving a nucleotide distance of less than 0.255, preferably of less than0.25, more preferably of less than 0.21, most preferably of less than0.17 to the sequence as represented in SEQ ID NO 149 in the regionspanning positions 7932 to 8271 in the NS5B region as shown in FIG. 1.

[0076] In the present application, the E1 sequences encoding theantigenic ectodomain of the E1 protein, which does not overlap thecarboxyterminal signal-anchor sequences of E1 disclosed by Cha et al.(1992; WO 92/19743), in addition to the NS4 epitope region, and a partof the NS5 region are disclosed for 4 different isolates: BR33, BR34,BR36, HCC153 and HD10, all belonging to type 3 a (SEQ ID NO 1, 3, 5, 7,9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 35, 37 or 39).

[0077] Also within the present invention are new subtype 3 c sequences(SEQ ID NO 147, 149 of the isolate BE98 in the Core and NS5 regions (seeFIGS. 3 and 1).

[0078] Finally the present invention also relates to a new subtype 3 asequence as represented in SEQ ID NO 217 (see FIG. 1).

[0079] Also included within the present invention are sequence variantsof the polynucleic acids as selected from any of the nucleotidesequences as given in any of the above mentioned SEQ ID numbers, withsaid sequence variants containing either deletions and/or insertions ofone or more nucleotides, mainly at the extremities of oligonucleotides(either 3′ or 5′), or substitutions of some non-essential nucleotides byothers (including modified nucleotides an/or inosine), for example, atype 1 or 2 sequence might be modified into a type 3 sequence byreplacing some nucleotides of the type 1 or 2 sequence withtype-specific nucleotides of type 3 as shown in FIG. 1 (NS5 region),FIG. 3 (Core region), FIG. 4 (Core/E1 region), FIG. 6 and 10 (NS3/NS4region).

[0080] According to another embodiment, the present invention relates toa polynucleic acid composition as defined above, wherein saidpolynucleic acids correspond to a nucleotide sequence selected from anyof the following HCV type 5 genomic sequences:

[0081] an HCV genomic sequence as having a homology of more than 85%,preferably more than 86%, most preferably more than 87% homology to anyof the sequences as represented in SEQ ID NO 41, 43, 45, 47, 49, 51, 53(PC sequences) or 151 (BE95 sequence) in the region spanning positions 1to 573 of the Core region as shown in FIG. 9 and 3;

[0082] an HCV genomic sequence as having a homology of more than 61%,preferably more than 63%, more preferably more than 65% homology, evenmore preferably more than 66% homology and most preferably more than 67%homology (f.i. 69 and 71%) to any of the sequences as represented in SEQID NO 41, 43, 45, 47, 49, 51, 53 (PC sequences), 153 or 155 (BE95, BE100sequences) in the region spanning positions 574 to 957 of the E1 regionas shown in FIG. 4;

[0083] an HCV genomic sequence having a homology of more than 76.5%,preferably of more than 77%, most preferably of more than 78% homologywith any of the sequences as represented in SEQ ID NO 55, 57, 197 or 199(PC sequences) in the region spanning positions 3856 to 4209 of the NS3region as shown in FIG. 6 or 10;

[0084] an HCV genomic sequence having a homology of more than 68%,preferably of more than 70%, most preferably of more than 72% homologywith the sequence as represented in SEQ ID NO 157 (BE95 sequence) in theregion spanning positions 980 to 1179 of the E1/E2 region as shown inFIG. 13;

[0085] an HCV genomic sequence having a homology of more than 57%,preferably more than 59%, most preferably more than 61% homology to anyof the sequences as represented in SEQ ID NO 59 or 61 (PC sequences) inthe region spanning positions 4936 to 5296 of the NS4 region as shown inFIG. 6 or 10;

[0086] an HCV genomic sequence as having a homology of more than 93%,preferably more than 93.5%, most preferably more than 94% homology toany of the sequences as represented in SEQ ID NO 159 or 161 (BE95 orBE96 sequences) in the region spanning positions 7932 to 8271 of theNS5B region as shown in FIG. 1.

[0087] Preferentially the above-mentioned genomic HCV sequences depictsequences from the coding regions of all the above-mentioned sequences.

[0088] According to the nucleotide distance classification system (withsaid nucleotide distances being calculated as explained above), saidsequences of said composition are selected from:

[0089] a nucleotide distance of less than 0.53, preferably less than0.51, more preferably less than 0.49 for the E1 region to the type 5sequences depicted above;

[0090] a nucleotide distance of less than 0.3, preferably less than0.28, more preferably of less than 0.26 for the Core region to the type5 sequences depicted above;

[0091] a nucleotide distance of less than 0.072, preferably less than0.071, more preferably less than 0.070 for the NS5B region to the type 5sequences as depicted above.

[0092] Isolates with similar sequences in the 5′UR to a group ofisolates including SA1, SA3, and SA7 described in the 5′UR by Bukh etal. (1992), have been reported and described in the 5′UR and NS5 regionas group V by Cha et al. (1992; WO 92/19743). This group of isolatesbelongs to type 5 a as described in the present invention (SEQ ID NO 41,43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 151, 153, 155, 157, 159, 161,197 and 199).

[0093] Also included within the present invention are sequence variantsof the polynucleic acids as selected from any of the nucleotidesequences as given in any of the above given SEQ ID numbers with saidsequence variants containing either deletion and/or insertions of one ormore nucleotides, mainly at the extremities of oligonucleotides (either3′ or 5′), or substitutions of some non-essential nucleotides (i.e.nucleotides not essential to discriminate between different genotypes ofHCV) by others (including modified nucleotides an/or inosine), forexample, a type 1 or 2 sequence might be modified into a type 5 sequenceby replacing some nucleotides of the type 1 or 2 sequence withtype-specific nucleotides of type 5 as shown in FIG. 3 (Core region),FIG. 4 (Core/E1 region), FIG. 10 (NS3/NS4 region), FIG. 14 (E1/E2region).

[0094] Another group of isolates including BU74 and BU79 having similarsequences in the 5′UR to isolates including Z6 and Z7 as described inthe 5′UR by Bukh et al. (1992), have been described in the 5′UR andclassified as a new type 4 by the inventors of this application (Stuyveret al., 1993). Coding sequences, including core, E1 and NS5 sequences ofseveral new Gabonese isolates belonging to this group, are disclosed inthe present invention (SEQ ID NO 106, 108, 110, 112, 114, 116, 118, 120and 122).

[0095] According to yet another embodiment, the present inventionrelates to a composition as defined above, wherein said polynucleicacids correspond to a nucleotide sequence selected from any of thefollowing HCV type 4 genomic sequences:

[0096] an HCV genomic sequence having a homology of more than 66%,preferably more than 68%, most preferably more than 70% homology in theE1 region spanning positions 574 to 957 to any of the sequences asrepresented in SEQ ID NO 118, 120 or 122 (GB358, GB549, GB809 sequences)as shown in FIG. 4;

[0097] an HCV genomic sequence having a homology of more than 71%,preferably more than 72%, most preferably more than 74% homology to anyof the sequences as represented in SEQ ID NO 118, 120 or 122 (GB358,GB549, GB809 sequences) in the region spanning positions 379 to 957 ofthe E1 region as shown in FIG. 4;

[0098] an HCV genomic sequence having a homology of more than 92%,preferably more than 93%, most preferably more than 94% homology to anyof the sequences as represented in SEQ ID NO 163 or 165 (GB809, CAM600sequences) in the region spanning positions 1 to 378 of the Core/E1region as shown in FIG. 4;

[0099] an HCV genomic sequence (subtype 4 c) having a homology of morethan 85%, preferably more than 86%, more preferably more than 86.5%homology, most preferably more than 87, more than 88 or more than 89%homology to any of the sequences as represented in SEQ ID NO 183, 185 or187 (GB116, GB215, GB809 sequences) in the region spanning positions 379to 957 of the E1 region as shown in FIG. 4;

[0100] an HCV genomic sequence (subtype 4 a) having a homology of morethan 81%, preferably more than 83%, most preferably more than 85%homology to the sequence as represented in SEQ ID NO 189 (GB908sequence) in the region spanning positions 379 to 957 of the E1 regionas shown in FIG. 4;

[0101] an HCV genomic sequence (subtype 4 e) having a homology of morethan 85%, preferably more than 87%, most preferably more than 89%homology to any of the sequences as represented in SEQ ID NO 167 or 169(CAM600, GB908 sequences) in the region spanning positions 379 to 957 ofthe E1 region as shown in FIG. 4;

[0102] an HCV genomic sequence (subtype 4 f) having a homology of morethan 79%, preferably more than 81%, most preferably more than 83%homology to any of the sequences as represented in SEQ ID NO 171 or 173(CAMG22, CAMG27 sequences) in the region spanning positions 379 to 957of the E1 region as shown in FIG. 4;

[0103] an HCV genomic sequence (subtype 4 g) having a homology of morethan 84%, preferably more than 86%, most preferably more than 88%homology to the sequence as represented in SEQ ID NO 175 (GB549sequence) in the region spanning positions 379 to 957 of the E1 regionas shown in FIG. 4;

[0104] an HCV genomic sequence (subtype 4 h) having a homology of morethan 83%, preferably more than 85%, most preferably more than 87%homology to the sequence as represented in SEQ ID NO 177 (GB438sequence) in the region spanning positions 379 to 957 of the E1 regionas shown in FIG. 4;

[0105] an HCV genomic sequence (subtype 4 i) as having a homology ofmore than 76%, preferably more than 78%, most preferably more than 80%homology to the sequence as represented in SEQ ID NO 179 (CAR4/1205sequence) in the region spanning positions 379 to 957 of the E1 regionas shown in FIG. 4;

[0106] an HCV genomic sequence (subtype 4 j?) having a homology of morethan 84%, preferably more than 86%, most preferably more than 88%homology to the sequence as represented in SEQ ID NO 181 (CAR4/901sequence) in the region spanning positions 379 to 957 of the E1 regionas shown in FIG. 4;

[0107] an HCV genomic sequence as having a homology of more than 73%,preferably more than 75%, most preferably more than 77% homology to anyof the sequences as represented in SEQ ID NO 106, 108, 110, 112, 114, or116 (GB48, GB116, GB215, GB358, GB549, GB809 sequences) in the regionspanning positions 7932 to 8271 of the NS5 region as shown in FIG. 1;

[0108] an HCV genomic sequence (subtype 4 c) having a homology of morethan 88%, preferably more than 89%, most preferably more than 90%homology to any of the sequences as represented in SEQ ID NO 106, 108,110, or 112 (GB48, GB116, GB215, GB358 sequences) in the region spanningpositions 7932 to 8271 of the NS5 region as shown in FIG. 1;

[0109] an HCV genomic sequence (subtype 4 e) having a homology of morethan 88%, preferably more than 89%, most preferably more than 90%homology to any of the sequences as represented in SEQ ID NO 116 or 201(GB809 or CAM 600 sequences) in the region spanning positions 7932 to8271 of the NS5 region as shown in FIG. 1;

[0110] an HCV genomic sequence (subtype 4 f) having a homology of morethan 87%, preferably more than 89%, most preferably more than 90%homology to the sequence as represented in SEQ ID NO 203 (CAMG22sequence) in the region spanning positions 7932 to 8271 of the NS5region as shown in FIG. 1;

[0111] an HCV genomic sequence (subtype 4 g) as having a homology ofmore than 85%, preferably more than 87%, most preferably more than 89%homology to the sequence as represented in SEQ ID NO 114 (GB549sequence) in the region spanning positions 7932 to 8271 of the NS5region as shown in FIG. 1;

[0112] an HCV genomic sequence (subtype 4 h) as having a homology ofmore than 86%, preferably more than 87%, more preferably more than 88%homology, more preferably more than 89% homology to the sequence asrepresented in SEQ ID NO 207 (GB437 sequence) in the region spanningpositions 7932 to 8271 of the NS5 region as shown in FIG. 1;

[0113] an HCV genomic sequence (subtype 4 i) having a homology of morethan 84%, preferably more than 86%, most preferably more than 88%homology to the sequence as represented in SEQ ID NO 209 (CAR4/1205sequence) in the region spanning positions 7932 to 8271 of the NS5region as shown in FIG. 1;

[0114] an HCV genomic sequence (subtype 4 j) having a homology of morethan 81%, preferably more than 83%, most preferably more than 85%homology to the sequence as represented in SEQ ID NO 211 (CAR1/501sequence) in the region spanning positions 7932 to 8271 of the NS5region as shown in FIG. 1.

[0115] Preferentially the above-mentioned genomic HCV sequences depictsequences from the coding regions of all the above-mentioned sequences.

[0116] According to the nucleotide distance classification system (withsaid nucleotide distances being calculated as explained above), saidsequences of said composition are selected from:

[0117] an HCV genomic sequence (type 4) being characterized as having anucleotide distance of less than 0.52, 0.50, 0.4880, 0.46, 0.44, 0.43 ormost preferably less than 0.42 in the region spanning positions 574 to957 to any of the sequences as represented in SEQ ID NO 118, 120 or 122in the region spanning positions 1 to 957 of the Core/E1 region as shownin FIG. 4;

[0118] an HCV genomic sequence (type 4) being characterized as having anucleotide distance of less than 0.39, 0.36 0.34 0.32 or most preferablyless than 0.31 to any of the sequences as represented in SEQ ID NO 118,120 or 122 in the region spanning positions 379 to 957 of the E1 regionas shown in FIG. 4;

[0119] an HCV genomic sequence (subtype 4 c) being characterized ashaving a nucleotide distance of less than 0.27, 0.26, 0.24, 0.22, 0.20,0.18, 0.17, 0.162, 0.16 or most preferably less than 0.15 to any of thesequences as represented in SEQ ID NO 183, 185 or 187 in the regionspanning positions 379 to 957 of the E1 region as shown in FIG. 4;

[0120] an HCV genomic sequence (subtype 4 a) being characterized ashaving a nucleotide distance of less than 0.30, 0.28, 0.26, 0.24, 0.22,0.21 or most preferably of less than 0.205 to the sequence asrepresented in SEQ ID NO 189 in the region spanning positions 379 to 957of the E1 region as shown in FIG. 4;

[0121] an HCV genomic sequence (subtype 4 e) being characterized ashaving a nucleotide distance of less than 0.26, 0.25, 0.23, 0.21, 0.19,0.17, 0.165, most preferably less than 0.16 to any of the sequences asrepresented in SEQ ID NO 167 or 169 in the region spanning positions 379to 957 of the E1 region as shown in FIG. 4;

[0122] an HCV genomic sequence (subtype 4 f) being characterized ashaving a nucleotide distance of less than 0.26, 0.24, 0.22, 0.20, 0.18,0.16, 0.15 or most preferably less than 0.14 to any of the sequences asrepresented in SEQ ID NO 171 or 173 in the region spanning positions 379to 957 of the E1 region as shown in FIG. 4;

[0123] an HCV genomic sequence (subtype 4 g) being characterized ashaving a nucleotide distance of less than 0.20, 0.19, 0.18, 0.17 or mostpreferably of less than 0.16 to the sequence as represented in SEQ ID NO175 in the region spanning positions 379 to 957 of the E1 region asshown in FIG. 4;

[0124] an HCV genomic sequence (subtype 4 h) being characterized ashaving a nucleotide distance of less than 0.20, 0.19, 0.18, 0.17 andmost preferably of less than 0.16 to the sequence as represented in SEQID NO 177 in the region spanning positions 379 to 957 of the E1 regionas shown in FIG. 4;

[0125] an HCV genomic sequence (subtype 4 i) being characterized ashaving a nucleotide distance of less than 0.27, 0.25, 0.23, 0.21 andpreferably less than 0.16 to the sequence as represented in SEQ ID NO179 in the region spanning positions 379 to 957 of the E1 region asshown in FIG. 4;

[0126] an HCV genomic sequence (subtype 4 j?) being characterized ashaving a nucleotide distance of less than 0.19, 0.18, 0.17, 0.165 andmost preferably of less than 0.16 to the sequence as represented in SEQID NO 181 in the region spanning positions 379 to 957 of the E1 regionas shown in FIG. 4;

[0127] an HCV genomic sequence (type 4) being characterized as having anucleotide distance of less than 0.35, 0.34, 0.32 and most preferably ofless than 0.30 to any of the sequences as represented in SEQ ID NO 106,108, 110, 112, 114, or 116 in the region spanning positions 7932 to 8271of the NS5 region as shown in FIG. 1;

[0128] an HCV genomic sequence (subtype 4 c) being characterized ashaving a nucleotide distance of less than 0.18, 0.16, 0.14, 0.135, 0.13,0.1275 or most preferably less than 0.125 to any of the sequences asrepresented in SEQ ID NO 106, 108, 110, or 112 in the region spanningpositions 7932 to 8271 of the NS5 region as shown in FIG. 1;

[0129] an HCV genomic sequence (subtype 4 e) being characterized ashaving a nucleotide distance of less than 0.15, 0.14, 0.135, 0.13 andmost preferably of less than 0.125 to any of the sequences asrepresented in SEQ ID NO 116 or 201 in the region spanning positions7932 to 8271 of the NS5 region as shown in FIG. 1;

[0130] an HCV genomic sequence (subtype 4 f) being characterized ashaving a nucleotide distance of less than 0.15, 0.14, 0.135, 0.13 ormost preferably less than 0.125 to the sequence as represented in SEQ IDNO 203 in the region spanning positions 7932 to 8271 of the NS5 regionas shown in FIG. 1;

[0131] an HCV genomic sequence (subtype 4 g) being characterized ashaving a nucleotide distance of less than 0.17, 0.16, 0.15, 0.14, 0.13or most preferably less than 0.125 to the sequence as represented in SEQID NO 114 in the region spanning positions 7932 to 8271 of the NS5region as shown in FIG. 1;

[0132] an HCV genomic sequence (subtype 4 h) being characterized ashaving a nucleotide distance of less than 0.155, 0.15, 0.145, 0.14,0.135, 0.13 or most preferably less than 0.125 to the sequence asrepresented in SEQ ID NO 207 in the region spanning positions 7932 to8271 of the NS5 region as shown in FIG. 1;

[0133] an HCV genomic sequence (subtype 4 i) being characterized ashaving a nucleotide distance of less than 0.17, 0.16, 0.15, 0.14, 0.13or most preferably of less than 0.125 to the sequence as represented inSEQ ID NO 209 in the region spanning positions 7932 to 8271 of the NS5region as shown in FIG. 1;

[0134] an HCV genomic sequence (subtype 4 j) being characterized ashaving a nucleotide distance of less than 0.21, 0.20, 0.19, 0.18, 0.17,0.16, 0.15, 0.14, 0.13 and most preferably of less than 0.125 to thesequence as represented in SEQ ID NO 211 in the region spanningpositions 7932 to 8271 of the NS5 region as shown in FIG. 1.

[0135] Also included within the present invention are sequence variantsof the polynucleic acids as selected from any of the nucleotidesequences as given in any of the above given SEQ ID numbers with saidsequence variants containing either deletion and/or insertions of one ormore nucleotides, mainly at the extremities of oligonucleotides (either3′ or 5′), or substitutions of some non-essential nucleotides (i.e.nucleotides not essential to discriminate between different genotypes ofHCV) by others (including modified nucleotides an/or inosine), forexample, a type 1 or 2 sequence might be modified into a type 4 sequenceby replacing some nucleotides of the type 1 or 2 sequence withtype-specific nucleotides of type 4 as shown in FIG. 3 (Core region),FIG. 4 (Core/E1 region), FIG. 10 (NS3/NS4 region), FIG. 14 (E1/E2region).

[0136] The present invention also relates to a sequence as representedin SEQ ID NO 193 (GB724 sequence).

[0137] After aligning NS5 or E1 sequences of GB48, GB, 116, GB215,GB358, GB549 and GB809, these isolates clearly segregated into 3subtypes within type 4: GB48, GB116, GB215 and GB358 belong to thesybtype designated 4 c, GB549 to subtype 4 g and GB809 to subtype 4 e.In NS5, GB809 (subtype 4 e) showed a higher nucleic acids homology tosubtype 4 c isolates (85.6-86.8%) than to GB549 (subtype 4 g, 79.7%),while GB549 showed similar homologies to both other subtypes (78.8 to80% to subtype 4 c and 79.7% to subtype 4 e). In E1, subtype 4 c showedequal nucleic acid homologies of 75.2% to subtypes 4g and 4e while 4 gand 4 e were 78.4% homologous. At the amino acid level however, subtype4 e showed a normal homology to subtype 4 c (80.2%), while subtype 4 gwas more homologous to 4 c (83.3%) and 4 e (84.1%).

[0138] According to yet another embodiment, the present inventionrelates to a composition as defined above, wherein said polynucleicacids correspond to a nucleotide sequence selected from any of thefollowing HCV type 2 d genomic sequences:

[0139] an HCV genomic sequence as having a homology of more than 78%,preferably more than 80%, most preferably more than 82% homology to thesequence as represented in SEQ ID NO (NE92) 143 in the region spanningpositions 379 to 957 of the Core/E1 region as shown in FIG. 4;

[0140] an HCV genomic sequence as having a homology of more than 74%,preferably more than 76%, most preferably more than 78% homology to thesequence as represented in SEQ ID NO 143 (NE92) in the region spanningpositions 574 to 957 as shown in FIG. 4;

[0141] an HCV genomic sequence as having a homology of more than 87%,preferably more than 89%, most preferably more than 91% homology to thesequence as represented in SEQ ID NO 145 (NE92) in the region spanningpositions 7932 to 8271 of the NS5B region as shown in FIG. 1.

[0142] Preferentially the above-mentioned genomic HCV sequences depictsequences from the coding regions of all the above-mentioned sequences.

[0143] According to the nucleotide distance classification system (withsaid nucleotide distances being calculated as explained above), saidsequences of said composition are selected from:

[0144] a nucleotide distance of less than 0.32, preferably less than0.31, more preferably less than 0.30 for the E1 region (574 to 957) toany of the above specified sequences;

[0145] a nucleotide distance of less than 0.08, preferably less than0.07, more preferably less than 0.06 for the Core region (1 to 378) toany of the above given sequences

[0146] a nucleotide distance of less than 0.15, preferantially less than0.13, more preferentially less than 0.12 for the NS5B region to any ofthe above-specified sequences.

[0147] Polynucleic acid sequences according to the present inventionwhich are homologous to the sequences as represented by a SEQ ID NO canbe characterized and isolated according to any of the techniques knownin the art, such as amplification by means of type or subtype specificprimers, hybridization with type or subtype specific probes under moreor less stringent conditions, serological screening methods (seeexamples 4 and 11) or via the LiPA typing system.

[0148] Polynucleic acid sequences of the genomes indicated above fromregions not yet depicted in the present examples, figures and sequencelisting can be obtained by any of the techniques known in the art, suchas amplification techniques using suitable primers from the type orsubtype specific sequences of the present invention.

[0149] The present invention relates also to a composition as definedabove, wherein said polynucleic acid is liable to act as a primer foramplifying the nucleic acid of a certain isolate belonging to thegenotype from which the primer is derived.

[0150] An example of a primer according to this embodiment of theinvention is HCPr 152 as shown in table 7 (SEQ ID NO 79).

[0151] The term “primer” refers to a single stranded DNA oligonucleotidesequence capable of acting as a point of initiation for synthesis of aprimer extension product which is complementary to the nucleic acidstrand to be copied. The length and the sequence of the primer must besuch that they allow to prime the synthesis of the extension products.Preferably the primer is about 5-50 nucleotides. Specific length andsequence will depend on the complexity of the required DNA or RNAtargets, as well as on the conditions of primer use such as temperatureand ionic strength.

[0152] The fact that amplification primers do not have to match exactlywith corresponding template sequence to warrant proper amplification isamply documented in the literature (Kwok et al., 1990).

[0153] The amplification method used can be either polymerase chainreaction (PCR; Saiki et al., 1988), ligase chain reaction (LCR; Landgrenet al., 1988; Wu & Wallace, 1989; Barany, 1991), nucleic acidsequence-based amplification (NASBA; Guatelli et al., 1990; Compton,1991), transcription-based amplification system (TAS; Kwoh et al.,1989), strand displacement amplification (SDA; Duck, 1990; Walker etal., 1992) or amplification by means of QB replicase (Lizardi et al.,1988; Lomeli et al., 1989) or any other suitable method to amplifynucleic acid molecules using primer extension. During amplification, theamplified products can be conveniently labelled either using labelledprimers or by incorporating labelled nucleotides. Labels may be isotopic(³²P, ³⁵S, etc.) or non-isotopic (biotin, digoxigenin, etc.). Theamplification reaction is repeated between 20 and 80 times,advantageously between 30 and 50 times.

[0154] The present invention also relates to a composition as definedabove, wherein said polynucleic acid is able to act as a hybridizationprobe for specific detection and/or classification into types of anucleic acid containing said nucleotide sequence, with saidoligonucleotide being possibly labelled or attached to a solidsubstrate.

[0155] The term “probe” refers to single stranded sequence-specificoligonucleotides which have a sequence which is complementary to thetarget sequence of the HCV genotype(s) to be detected.

[0156] Preferably, these probes are about 5 to 50 nucleotides long, morepreferably from about 10 to 25 nucleotides.

[0157] The term “solid support” can refer to any substrate to which anoligonucleotide probe can be coupled, provided that it retains itshybridization characteristics and provided that the background level ofhybridization remains low. Usually the solid substrate will be amicrotiter plate, a membrane (e.g. nylon or nitrocellulose) or amicrosphere (bead). Prior to application to the membrane or fixation itmay be convenient to modify the nucleic acid probe in order tofacilitate fixation or improve the hybridization efficiency. Suchmodifications may encompass homopolymer tailing, coupling with differentreactive groups such as aliphatic groups, NH₂ groups, SH groups,carboxylic groups, or coupling with biotin or haptens.

[0158] The present invention also relates to the use of a composition asdefined above for detecting the presence of one or more HCV genotypes,more particularly for detecting the presence of a nucleic acid of any ofthe HCV genotypes having a nucleotide sequence as defined above, presentin a biological sample liable to contain them, comprising at least thefollowing steps:

[0159] (i) possibly extracting sample nucleic acid,

[0160] (ii) possibly amplifying the nucleic acid with at least one ofthe primers as defined above or any other HCV subtype 2 d, HCV type 3,HCV type 4, HCV type 5 or universal HCV primer,

[0161] (iii) hybrizing the nucleic acids of the biological sample,possibly under denatured conditions, and with said nucleic acids beingpossibly labelled during or after amplification, at appropriateconditions with one or more probes as defined above, with said probesbeing preferably attached to a solid substrate,

[0162] (iv) washing at appropriate conditions,

[0163] (v) detecting the hybrids formed,

[0164] (vi) inferring the presence of one or more HCV genotypes presentfrom the observed hybridization pattern.

[0165] Preferably, this technique could be performed in the Core or NS5Bregion.

[0166] The term “nucleic acid” can also be referred to as analyte strandand corresponds to a single- or double-stranded nucleic acid molecule.This analyte strand is preferentially positive- or negative strandedRNA, cDNA or amplified cDNA.

[0167] The term “biological sample” refers to any biological sample(tissue or fluid) containing HCV nucleic acid sequences and refers moreparticularly to blood serum or plasma samples.

[0168] The term “HCV subtype 2 d primer” refers to a primer whichspecifically amplifies HCV subtype 2 d sequences present in a sample(see Examples section and figures).

[0169] The term “HCV type 3 primer” refers to a primer whichspecifically amplifies HCV type 3 sequences present in a sample (seeExamples section and figures).

[0170] The term “HCV type 4 primer” refers to a primer whichspecifically amplifies HCV type 4 genomes present in a sample.

[0171] The term “universal HCV primer” refers to oligonucleotidesequences complementary to any of the conserved regions of the HCVgenome.

[0172] The term “HCV type 5 primer” refers to a primer whichspecifically amplifies HCV type 5 genomes present in a sample. The term“universal HCV primer” refers to oligonucleotide sequences complementaryto any of the conserved regions of the HCV genome.

[0173] The expression “appropriate” hybridization and washing conditionsare to be understood as stringent and are generally known in the art(e.g. Maniatis et al., Molecular Cloning: A Laboratory Manual, New York,Cold Spring Harbor Laboratory, 1982).

[0174] However, according to the hybridization solution (SSC, SSPE,etc.), these probes should be hybridized at their appropriatetemperature in order to attain sufficient specificity.

[0175] The term “labelled” refers to the use of labelled nucleic acids.This may include the use of labelled nucleotides incorporated during thepolymerase step of the amplification such as illustrated by Saiki et al.(1988) or Bej et al. (1990) or labelled primers, or by any other methodknown to the person skilled in the art.

[0176] The process of the invention comprises the steps of contactingany of the probes as defined above, with one of the following elements:

[0177] either a biological sample in which the nucleic acids are madeavailable for hybridization,

[0178] or the purified nucleic acids contained in the biological sample

[0179] or a single copy derived from the purified nucleic acids,

[0180] or an amplified copy derived from the purified nucleic acids,with said elements or with said probes being attached to a solidsubstrate.

[0181] The expression “inferring the presence of one or more HCVgenotypes present from the observed hybridization pattern” refers to theidentification of the presence of HCV genomes in the sample by analyzingthe pattern of binding of a panel of oligonucleotide probes. Singleprobes may provide useful information concerning the presence or absenceof HCV genomes in a sample. On the other hand, the variation of the HCVgenomes is dispersed in nature, so rarely is any one probe able toidentify uniquely a specific HCV genome. Rather, the identity of an HCVgenotype may be inferred from the pattern of binding of a panel ofoligonucleotide probes, which are specific for (different) segments ofthe different HCV genomes. Depending on the choice of theseoligonucleotide probes, each known HCV genotype will correspond to aspecific hybridization pattern upon use of a specific combination ofprobes. Each HCV genotype will also be able to be discriminated from anyother HCV genotype amplified with the same primers depending on thechoice of the oligonucleotide probes. Comparison of the generatedpattern of positively hybridizing probes for a sample containing one ormore unkown HCV sequences to a scheme of expected hybridizationpatterns, allows one to clearly infer the HCV genotypes present in saidsample. The present invention thus relates to a method as defined above,wherein one or more hybridization probes are selected from any of SEQ IDNO 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35,37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59 or 61, 106, 108, 110,112, 114, 116, 118, 120, 122, 143, 145, 147, 149, 151, 153, 155, 157,159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185,187, 198, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213,215, 217, 222, 269 or sequence variants thereof, with said sequencevariants containing deletions and/or insertions of one or morenucleotides, mainly at their extremities (either 3′ or 5′), orsubstitutions of some non-essential nucleotides (i.e. nucleotides notessential to discriminate between genotypes) by others (includingmodified nucleotides or inosine), or with said variants consisting ofthe complement of any of the above-mentioned oligonucleotide probes, orwith said variants consisting of ribonucleotides instead ofdeoxyribonucleotides, all provided that said variant probes can becaused to hybridize with the same specificity as the oligonucleotideprobes from which they are derived.

[0182] In order to distinguish the amplified HCV genomes from eachother, the target polynucleic acids are hybridized to a set ofsequence-specific DNA probes targetting HCV genotypic regions located inthe HCV polynucleic acids.

[0183] Most of these probes target the most type-specific regions of HCVgenotypes, but some can be caused to hybridize to more than one HCVgenotype. According to the hybridization solution (SSC, SSPE, etc.),these probes should be stringently hybridized at their appropriatetemperature in order to attain sufficient specificity. However, byslightly modifying the DNA probes, either by adding or deleting one or afew nucleotides at their extremities (either 3′ or 5′), or substitutingsome non-essential nucleotides (i.e. nucleotides not essential todiscriminate between types) by others (including modified nucleotides orinosine) these probes or variants thereof can be caused to hybridizespecifically at the same hybridization conditions (i.e. the sametemperature and the same hybridization solution). Also changing theamount (concentration) of probe used may be beneficial to obtain morespecific hybridization results. It should be noted in this context, thatprobes of the same length, regardless of their GC content, willhybridize specifically at approximately the same temperature in TMAClsolutions (Jacobs et al., 1988).

[0184] Suitable assay methods for purposes of the present invention todetect hybrids formed between the oligonucleotide probes and the nucleicacid sequences in a sample may comprise any of the assay formats knownin the art, such as the conventional dot-blot format, sandwichhybridization or reverse hybridization. For example, the detection canbe accomplished using a dot blot format, the unlabelled amplified samplebeing bound to a membrane, the membrane being incorporated with at leastone labelled probe under suitable hybridization and wash conditions, andthe presence of bound probe being monitored.

[0185] An alternative and preferred method is a “reverse” dot-blotformat, in which the amplified sequence contains a label. In thisformat, the unlabelled oligonucleotide probes are bound to a solidsupport and exposed to the labelled sample under appropriate stringenthybridization and subsequent washing conditions. It is to be understoodthat also any other assay method which relies on the formation of ahybrid between the nucleic acids of the sample and the oligonucleotideprobes according to the present invention may be used.

[0186] According to an advantageous embodiment, the process of detectingone or more HCV genotypes contained in a biological sample comprises thesteps of contacting amplified HCV nucleic acid copies derived from thebiological sample, with oligonucleotide probes which have beenimmobilized as parallel lines on a solid support.

[0187] According to this advantageous method, the probes are immobilizedin a Line Probe Assay (LiPA) format. This is a reverse hybridizationformat (Saiki et al., 1989) using membrane strips onto which severaloligonucleotide probes (including negative or positive controloligonucleotides) can be conveniently applied as parallel lines.

[0188] The invention thus also relates to a solid support, preferably amembrane strip, carrying on its surface, one or more probes as definedabove, coupled to the support in the form of parallel lines.

[0189] The LiPA is a very rapid and user-friendly hybridization test.Results can be read 4 h. after the start of the amplification. Afteramplification during which usually a non-isotopic label is incorporatedin the amplified product, and alkaline denaturation, the amplifiedproduct is contacted with the probes on the membrane and thehybridization is carried out for about 1 to 1,5 h hybridized polynucleicacid is detected. From the hybridization pattern generated, the HCV typecan be deduced either visually, but preferably using dedicated software.The LiPA format is completely compatible with commercially availablescanning devices, thus rendering automatic interpretation of the resultsvery reliable. All those advantages make the LiPA format liable for theuse of HCV detection in a routine setting. The LiPA format should beparticularly advantageous for detecting the presence of different HCVgenotypes.

[0190] The present invention also relates to a method for detecting andidentifying novel HCV genotypes, different from the known HCV genomes,comprising the steps of:

[0191] determining to which HCV genotype the nucleotides present in abiological sample belong, according to the process as defined above,

[0192] in the case of observing a sample which does not generate ahybridization pattern compatible with those defined in Table 3,sequencing the portion of the HCV genome sequence corresponding to theaberrantly hybridizing probe of the new HCV genotype to be determined.

[0193] The present invention also relates to the use of a composition asdefined above, for detecting one or more genotypes of HCV present in abiological sample liable to contain them, comprising the steps of:

[0194] (i) possibly extracting sample nucleic acid,

[0195] (ii) amplifying the nucleic acid with at least one of the primersas defined above,

[0196] (iii) sequencing the amplified products

[0197] (iv) inferring the HCV genotypes present from the determinedsequences by comparison to all known HCV sequences.

[0198] The present invention also relates to a composition consisting ofor comprising at least one peptide or polypeptide comprising acontiguous sequence of at least 5 amino acids corresponding to acontiguous amino acid sequence encoded by at least one of the HCVgenomic sequences as defined above, having at least one amino aciddiffering from the corresponding region of known HCV (type 1 and/or type2 and/or type 3) polyprotein sequences as shown in Table 3, or muteinsthereof.

[0199] It is to be noted that, at the level of the amino acid sequence,an amino acid difference (with respect to known HCV amino acidsequences) is necessary, which means that the polypeptides of theinvention correspond to polynucleic acids having a nucleotide difference(with known HCV polynucleic acid sequences) involving an amino aciddifference.

[0200] The new amino acid sequences, as deduced from the disclosednucleotide sequences (see SEQ ID NO 1 to 62 and 106 to 123 and 143 to218, 223 and 270), show homologies of only 59.9 to 78% with prototypesequences of type 1 and 2 for the NS4 region, and of only 53.9 to 68.8%with prototype sequences of type 1 and 2 for the E1 region. As the NS4region is known to contain several epitopes, for example characterizedin patent application EP-A-0 489 968, and as the E1 protein is expectedto be subject to immune attack as part of the viral envelope andexpected to contain epitopes, the NS4 and E1 epitopes of the new type 3,4 and 5 isolates will consistently differ from the epitopes present intype 1 and 2 isolates. This is examplified by the type-specificity ofNS4 synthetic peptides as presented in example 4, and thetype-specificity of recombinant E1 proteins in example 11.

[0201] After aligning the new subtype 2 d, type 3, 4 and 5 (see SEQ IDNO 1 to 62 and 106 to 123 and 143 to 218, 223 and 270) amino acidsequences with the prototype sequences of type 1 a, 1 b, 2 a, and 2 b,type- and subtype-specific variable regions can be delineated aspresented in FIG. 5 and 7.

[0202] As to the muteins derived from the polypeptides of the invention,Table 4 gives an overview of the amino acid substitutions which could bethe basis of some of the muteins as defined above.

[0203] The peptides according to the present invention containpreferably at least 5 contiguous HCV amino acids, preferably however atleast 8 contiguous amino acids, at least 10 or at least 15 (for instanceat least 9, 11, 12, 13, 14, 20 or 25 amino acids) of the new HCVsequences of the invention. TABLE 4 Amino acids Synonymous groups Ser(S) Ser, Thr, Gly, Asn Arg (R) Arg, His, Lys, Glu, Gln Leu (L) Leu; Ile,Met, Phe, Val, Tyr Pro (P) Pro, Ala, Thr, Gly Thr (T) Thr, Pro, Ser,Ala, Gly, His, Gln Ala (A) Ala, Pro, Gly, Thr Val (V) Val, Met, Ile,Tyr, Phe, Leu, Val Gly (G) Gly, Ala, Thr, Pro, Ser Ile (I) Ile, Met,Leu, Phe, Val, Ile, Tyr Phe (F) Phe, Met, Tyr, Ile, Leu, Trp, Val Tyr(Y) Tyr, Phe, Trp, Met, Ile, Val, Leu Cys (C) Cys, Ser, Thr, Met His (H)His, Gln, Arg, Lys, Glu, Thr Gln (Q) Gln, Glu, His, Lys, Asn, Thr, ArgAsn (N) Asn, Asp, Ser, Gln Lys (K) Lys, Arg, Glu, Gln, His Asp (D) Asp,Asn, Glu, Gln Glu (E) Glu, Gln, Asp, Lys, Asn, His, Arg Met (M) Met,Ile, Leu, Phe, Val

[0204] The polypeptides of the invention, and particularly thefragments, can be prepared by classical chemical synthesis.

[0205] The synthesis can be carried out in homogeneous solution or insolid phase.

[0206] For instance, the synthesis technique in homogeneous solutionwhich can be used is the one described by Houbenweyl in the bookentitled “Methode der organischen chemie” (Method of organic chemistry)edited by E. Wunsh, vol. 15-1 et II. THIEME, Stuttgart 1974.

[0207] The polypeptides of the invention can also be prepared in solidphase according to the methods described by Atherton and Shepard intheir book entitled “Solid phase peptide synthesis” (IRL Press, Oxford,1989).

[0208] The polypeptides according to this invention can be prepared bymeans of recombinant DNA techniques as described by Maniatis et al.,Molecular Cloning: A Laboratory Manual, New York, Cold Spring HarborLaboratory, 1982).

[0209] The present invention relates particularly to a polypeptide orpeptide composition as defined above, wherein said contiguous sequencecontains in its sequence at least one of the following amino acidresidues: L7, Q43, M44, S60, R67, Q70, T71, A79, A87, N106, K115, A127,A190, S130, V134, G142, I144, E152, A157, V158, P165, S177 or Y177,I178, V180 or E180 or F182, R184, I186, H187, T189, A190, S191 or G191,Q192 or L192 or I192 or V192 or E192, N193 or H193 or P193, W194 orY194, H195, A197 or I197 or V197 or T197, V202, 1203 or L203, Q208,A210, V212, F214, T216, R217 or D217 or E217 or V217, H218 or N218, H219or V219 or L219, L227 or I227, M231 or E231 or Q231, T232 or D232 orA232 or K232, Q235 or I235, A237 or T237, I242, I246, S247, S248, V249,S250 or Y250, I251 or V251 or M251 or F251, D252, T254 or V254, L255 orV255, E256 or A256, M258 or F258 or V258, A260 or Q260 or S260, A261,T264 or Y264, M265, 1266 or A266, A267, G268 or T268, F271 or M271 orV271, I277, M280 or H280, I284 or A284 or L84, V274, V291, N292 or S292,R293 or I293 or Y293, Q294 or R94, L297 or I297 or Q297, A299 or K299 orQ299, N303 or T303, T308 or L308, T310 or F310 or A310 or D310 or V310,L313, G317 or Q317, L333, S351, A358, A359, A363, S364, A366, T369,L373, F376, Q386, I387, S392, I399, F402, I403, R405, D454, A461, A463,T464, K484, Q500, E501, S521, K522, H524, N528, S531, S532, V534, F536,F537, M539, I546, C1282, A1283, H1310, V1312, Q1321, P1368, V1372,V1373, K1405, Q1406, S1409, A1424, A1429, C1435, S1436, S1456, H1496,A1504, D1510, D1529, I1543, N1567, D1556, N1567, M1572, Q1579, L1581,S1583, F1585, V1595, E1606 or T1606, M1611, V1612 or L1612, P1630,C1636, P1651, T1656 or I1656, L1663, V1667, V1677, A1681, H1685, E1687,G1689, V1695, A1700, Q1704, Y1705, A1713, A1714 or S1714, M1718, D1719,A1721 or T1721, R1722, A1723 or V1723, H1726 or G1726, E1730, V1732,F1735, I1736, S1737, R1738, T1739, G1740, Q1741, K1742, Q1743, A1744,T1745, L1746, E1747 or K1747, I1749, A1750, T1751 or A1751, V1753,N1755, K1756, A1757, P1758, A1759, H1762, T1763, Y1764, P2645, A2647,K2650, K2653 or L2653, S2664, N2673, F2680, K2681, L2686, H2692, Q2695or L2695 or 12695, V2712, F2715, V2719 or Q2719, T2722, T2724, S2725,R2726, G2729, Y2735, H2739, I2748, G2746 or I2746, I2748, P2752 orK2752, P2754 or T2754, T2757 or P2757, with said notation being composedof a letter representing the amino acid residue by its one-letter code,and a number representing the amino acid numbering according to Kato etal., 1990 as shown in Table 1 (comparison with other isolates). See alsothe numbering in FIGS. 2, 5, 7, and 11 (alignment amino acid sequences).

[0210] Within the group of unique and new amino acid residues of thepresent invention, the following residues were found to be specific forthe following types of HCV according to the HCV classification systemused in the present invention:

[0211] Q208, R217, E231, I235, I246, T264, I266, A267, F271, K299,L2686, Q2719 which are specific for the HCV subtype 2 d sequences of thepresent invention as shown in FIG. 5 and 2;

[0212] Q43, S60, R67, F182, I186, H187, A190, S191, L192, W194, V202,L203, V219, Q231, D232, A237, T-254, M280, Q299, T303, L308, and/or L313which are specific for the Core/E1 region of HCV type 3 of the inventionas shown in FIG. 5;

[0213] D1556, Q1579, L1581, S1584, F1585, E1606, V1612, P1630, C1636,T1656, L1663, H1685, E1687, G1689, V1695, Y1705, A1713, A1714, A1721,V1723, H1726, R1738, Q1743, A1744, E1747, 11749, A1751, A1759 and/orH1762 which are specific for the NS3/4 region of HCV type 3 sequences ofthe invention as shown in FIG. 7;

[0214] K2665, D2666, R2670 which are specific for the NS5B region of HCVtype 3 of the invention as shown in FIG. 2;

[0215] L7, A79, A127, S130, E152, V158, S177 or Y177, V180 or E180,R184, T189, Q192 or E192 or I192, N193 or H193, I197 or V197, I203,A210, V212, E217, H218, H219, L227, A232, V249, I251 or M251, D252, L255or V255, E256, M258 or V258 or F258, A260 or Q260, M265, T268, V271,V274, M280, I284, N292 or S292, Q294, L297 or I297, T308, A310 or D310or V310 or T310, and G317 which are specific for the core/E1 region ofHCV type 4 sequences of the present invention as shown in FIG. 5;

[0216] P2645, K2650, K2653, G2656, V2658, T2668, N2673 or N2673, K2681,H2686, D2691, L2692, Q2695 or L2695 or I2695, Y2704, V2712, F2715,V2719, I2722, S2725, G2729, Y2735, G2746 or I2746, P2752 or K2752,Q2753, P2754 or T2754, T2757 or P2757 which are specific for the NS5Bregion of the HCV type 4 sequences of the present invention as shown inFIG. 2;

[0217] M44, Q70, A87, N106, K115, V137, G142, P165, I178, F251, A299,N303, Q317 which are specific for the Core/E1 region of the HCV type 4sequences of the present invention as shown in FIG. 5;

[0218] L333, S351, A358, A359, A363, S364, A366, T369, L373, F376, Q386,I387, S392, I399, F102, I403, R405, D454, A461, A463, T464, K484, Q500,E501, S521, K522, H524, N528, S532, V534, F537, M539, 1546 which arespecific for the E1/E2 region of the HCV type 5 sequences of the presentinvention as shown in FIG. 12;

[0219] C1282, A1283, V1312, Q1321, P1368, V1372, K1405, Q1406, S1409,A1424, A1429, C1435, S1436, S1456, H1496, A1504, D1510, D1529, I1543,N1567, M1572, V1595, T1606, M1611, L1612, I1656, V1667, A1681, A1700,A1713, S1714, M1718, D1719, T1721, R1722, A1723, G1726, F1735, I1736,S1737, T1739, G1740, K1742, T1745, L1746, K1747, A1750, V1753, N1755,A1757, D1758, T1763, and Y1764 which are specific for the NS3/NS4 regionof HCV type 5 sequences of the invention as shown in FIG. 7;

[0220] A2647, L2653, S2674, F2680, T2724, R2726, Y2730, H2739 which arespecific for the NS5B region of the HCV type 5 sequences of the presentinvention as shown in FIG. 2;

[0221] A256, P1631, V1677, Q1704, E1730, V1732, Q1741 and T1751 whichare specific for the HCV type 3 and 5 sequences of the present inventionas shown in FIG. 5 and 7;

[0222] T71, A157, I227, T237, T240, Y250, V251, S260, M271, T2673,T2722, I2748 which are specific for the HCV type 3 and 4 sequences ofthe present invention as shown in FIG. 5 and 2,

[0223] V192, Y194, A197, P249, S250, R294 which are specific for the HCVtype 4 and 5 sequences of the present invention as shown in FIG. 5;

[0224] I293 which is specific for the HCV type 4 and subtype 2 dsequence of the present invention as shown in FIG. 5;

[0225] D217 and R294 which are specific for the HCV type 3, 4 and 5sequences of the present invention as shown in FIG. 5;

[0226] L192 which is specific for the HCV type 3 and subtype 2 dsequences of the present invention as shown in FIG. 5;

[0227] G191 and T197 which are specific for the HCV type 3, 4 andsubtype 2 d sequences of the present invention as shown in FIG. 5;

[0228] K232 which is specific for the HCV subtype 2 d en type 5sequences of the present invention as shown in FIG. 5.

[0229] and with said notation being composed of a letter, unambiguouslyrepresenting the amino acid by its one-letter code, and a numberrepresenting the amino acid numbering according to Kato et al., 1990(see also Table 1 for comparison with other isolates), as well as FIG. 2(NS5 region), FIG. 5 (Core/E1 region), FIG. 7 (NS3/NS4 region), FIG. 12(E1/E2 region). Some of the above-mentioned amino acids may be containedin type or subtype specific epitopes.

[0230] For example M231 (detected in type 5) refers to a methionine atposition 231. A glutamine (Q) is present at the same position 231 intype 3 isolates, whereas this position is occupied by an arginine intype 1 isolates and by a lysine (K) or asparagine (N) in type 2 isolates(see FIG. 5).

[0231] The peptide or polypeptide according to this embodiment of theinvention may be possibly labelled, or attached to a solid substrate, orcoupled to a carrier molecule such as biotin, or mixed with a properadjuvant.

[0232] The variable region in the core protein (V-CORE in FIG. 5) hasbeen shown to be useful for serotyping (Machida et al., 1992). Thesequence of the disclosed type 5 sequence in this region showstype-specific features. The peptide from amino acid 70 to 78 shows thefollowing unique sequence for the sequences of the present inevntion(see FIG. 5): QPTGRSWGQ (SEQ ID NO 93) RSEGRTSWAQ (SEQ ID NO 220) andRTEGRTSWAQ (SEQ ID NO 221)

[0233] Another preferred V-Core spanning region is the peptide spanningpositions 60 to 78 of subtype 3 c with sequence:

[0234] SRRQPIPRARRTEGRSWAQ (SEQ ID NO 268)

[0235] Five type-specific variable regions (V1 to V5) can be identifiedafter aligning E1 amino acid sequences of the 4 genotypes, as shown inFIG. 5.

[0236] Region V1 encompasses amino acids 192 to 203, this is theamino-terminal 10 amino acids of the E1 protein. The following uniquesequences as shown in FIG. 5 can be deduced: LEWRNTSGLYVL (SEQ ID NO 83)VNYRNASGIYHI (SEQ ID NO 126) QHYRNISGIYHV (SEQ ID NO 127) EHYRNASGIYHI(SEQ ID NO 128) IHYRNASGIYHI (SEQ ID NO 224) VPYRNASGIYHV (SEQ ID NO 84)VNYRNASGIYHI (SEQ ID NO 225) VNYRNASGVYHI (SEQ ID NO 226) VNYHNTSGIYHL(SEQ ID NO 227) QHYRNASGIYHV (SEQ ID NO 228) QHYRNVSGIYHV (SEQ ID NO229) IHYRNASDGYYI (SEQ ID NO 230) LQVKNTSSSYMV (SEQ ID NO 231)

[0237] Region V2 encompasses amino acids 213 to 223. The followingunique sequences can be found in the V2 region as shown in FIG. 5:VYEADDVILHT (SEQ ID NO 85) VYETEHHILHL (SEQ ID NO 129) VYEADHHIMHL (SEQID NO 130) VYETDHHILHL (SEQ ID NO 131) VYEADNLILHA (SEQ ID NO 86)VWQLRAIVLHV (SEQ ID NO 232) VYEADYHILHL (SEQ ID NO 233) VYETDNHILHL (SEQID NO 234) VYETENHILHL (SEQ ID NO 235) VFETVHHILHL (SEQ ID NO 236)VFETEHHILHL (SEQ ID NO 237) VFETDHHIMHL (SEQ ID NO 238) VYETENHILHL (SEQID NO 239) VYEADALILHA (SEQ ID NO 240)

[0238] Region V3 encompasses the amino acids 230 to 242. The followingunique V3 region sequences can be deduced from FIG. 5: VQDGNTSTCWTPV(SEQ ID NO 87) VQDGNTSACWTPV (SEQ ID NO 241) VRVGNQSRCWVAL (SEQ ID NO132) VRTGNTSRCWVPL (SEQ ID NO 133) VRAGNVSRCWTPV (SEQ ID NO 134)EEKGNISRCWIPV (SEQ ID NO 242) VKTGNQSRCWVAL (SEQ ID NO 243)VRTGNQSRCWVAL (SEQ ID NO 244) VKTGNQSRCWIAL (SEQ ID NO 245)VKTGNVSRCWIPL (SEQ ID NO 247) VKTGNVSRCWISL (SEQ ID NO 248)VRKDNVSRCWVQI (SEQ ID NO 249)

[0239] Region 4 encompasses the amino acids 248 to 257. The followingunique V4 region sequences can be deduced from FIG. 5: VRYVGATTAS (SEQID NO 89) APYIGAPLES (SEQ ID NO 135) APYVGAPLES (SEQ ID NO 136)AVSMDAPLES (SEQ ID NO 137) APSLGAVTAP (SEQ ID NO 90) APSFGAVTAP (SEQ IDNO 250) VSQPGALTKG (SEQ ID NO 251) VKYVGATTAS (SEQ ID NO 252) APYIGAPVES(SEQ ID NO 253) AQHLNAPLES (SEQ ID NO 254) SPYVGAPLEP (SEQ ID NO 255)SPYAGAPLEP (SEQ ID NO 256) APYLGAPLEP (SEQ ID NO 257) APYLGAPLES (SEQ IDNO 258) APYVGAPLES (SEQ ID NO 259) VPYLGAPLTS (SEQ ID NO 260) APHLRAPLSS(SEQ ID NO 261) APYLGAPLTS (SEQ ID NO 262)

[0240] Region V5 encompasses the amino acids 294 to 303. The followingunique V5 region peptides can be deduced from FIG. 5: RPRRHQTVQT (SEQ IDNO 91) QPRRHWTTQD (SEQ ID NO 138) RPRRHWTTQD (SEQ ID NO 139) RPRQHATVQN(SEQ ID NO 92) RPRQHATVQD (SEQ ID NO 263) SPQHHKFVQD (SEQ ID NO 264)RPRRLWTTQE (SEQ ID NO 265) PPRIHETTQD (SEQ ID NO 266)

[0241] The variable region in the E2 region (HVR-2) of type 5 a as shownin FIG. 12 spanning amino acid positions 471 to 484 is also a preferredpeptide according to the present invention with the following sequence:TISYANGSGPSDDK (SEQ ID NO 267)

[0242] The above given list of peptides are particularly suitable forvaccine and diagnostic development.

[0243] Also comprised in the present invention is any synthetic peptideor polypeptide containing at least 5 contiguous amino acids derived fromthe above-defined peptides in their peptidic chain.

[0244] According to a specific embodiment, the present invention relatesto a composition as defined above, wherein said contiguous sequence isselected from any of the following HCV amino acid type 3 sequences:

[0245] a sequence having a homology of more than 72%, preferably morethan 74%, more preferably more than 77% and most preferably more than 80or 84% homology to any of the amino acid sequences as represented in SEQID NO 14, 16, 18, 20, 22, 24, 26 or 28 (HD10, BR36, BR33 sequences) inthe region spanning positions 140 to 319 in the Core/E1 region as shownin FIG. 5;

[0246] a sequence having a homology of more than 70%, preferably morethan 72%, more preferably more than 75% homology, most preferably morethan 81% homology to any of the amino acid sequences as represented inSEQ ID NO 14, 16, 18, 20, 22, 24, 26 or 28 (HD10, BR36, BR33 sequences)in the E1 region spanning positions 192 to 319 as shown in FIG. 5;

[0247] a sequence having a homology of more than 86%, preferably morethan 88%, and most preferably more than 90% homology to the amino acidsequences as represented in SEQ ID NO 148 (type 3 c); BE98 in the regionspanning positions 1 to 110 in the Core region as shown in FIG. 5;

[0248] a sequence having a homology of more than 76%, preferably morethan 78%, most preferably more than 80% to any of the amino acidsequences as represented in SEQ ID NO 30, 32, 34, 36, 38 or 40 (HCC153,HD10, BR36 sequences) in the region spanning positions 1646 to 1764 inthe NS3/NS4 region as shown in FIG. 7 and 11;

[0249] a sequence having a homology of more than 81%, preferably morethan 83%, and most preferably more than 86% homology to any of the aminoacid sequences as represented in SEQ ID NO 14, 16, 18, 20, 22, 24, 26 or28 (HD10, BR36, BR33 sequences) in the region spanning positions 140 to319 in the Core/E1 region as shown in FIG. 5;

[0250] a sequence having a homology of more than 81.5%, preferably morethan 83%, and most preferably more than 86% homology to any of the aminoacid sequences as represented in SEQ ID NO 14, 16, 18, 20, 22, 24, 26 or28 (HD10, BR36, BR33 sequences) in the E1 region spanning positions 192to 319 as shown in FIG. 5;

[0251] a sequence having a homology of more than 86%, preferably morethan 88%, most preferably more than 90% to the amino acid sequence asrepresented in SEQ ID NO 150; (type 3 c BE98) in the region spanningpositions 2645 to 2757 in the NS5B region as shown in FIG. 2.

[0252] According to yet another embodiment, the present inventionrelates to a composition as defined above, wherein said contiguoussequence is selected from any of the following HCV amino acid type 4sequences:

[0253] a sequence having a homology of more than 80%, preferably morethan 82%, most preferably more than 84% homology to any of the aminoacid sequences as represented in SEQ ID NO 118, 120, and 122 (GB358,GB549, GB809 sequences) in the region spanning positions 127 to 319 ofthe Core/E1 region as shown in FIG. 5;

[0254] a sequence having a homology of more than 73%, preferably morethan 75%, most preferably more than 78% homology in the E1 regionspanning positions 192 to 319 to any of the amino acid sequences asrepresented in SEQ ID NO 118, 120, and 122 (GB358, GB549, GB809sequences) in the region spanning positions 140 to 319 of the Core/E1region as shown in FIG. 5;

[0255] a sequence having more than 85%, preferably more than 86%, mostpreferably more than 87% homology to any of the amino acid sequences asrepresented in SEQ ID NO 118, 120 or 122 (GB358, GB549, GB809 sequences)in the region spanning positions 192 to 319 of E1 as shown in FIG. 5;

[0256] a sequence showing more than 73%, preferably more than 74%, mostpreferably more than 75% homology to any of the amino acid sequences asrepresented in SEQ ID NO 106, 108, 110, 112, 114 or 116 (GB48, GB116,GB215, GB358, GB549, GB809 sequences) in the region spanning positions2645 to 2757 of the NS5B region as shown in FIG. 2;

[0257] a sequence having any of the sequences as represented in SEQ IDNO 164 or 166 (GB809 and CAM600 sequences) in the Core/E1 region asshown in FIG. 5;

[0258] a sequence having any of the sequences as represented in SEQ IDNO 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188 or 190 (CAM600,GB809, CAMG22, CAMG27, GB549, GB438, CAR4/1205, CAR4/901, GB116, GB215,GB958, GB809-4 sequences) in the E1 region as shown in FIG. 5;

[0259] a sequence having any of the sequences as represented in SEQ IDNO 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212 (GB358, GB724,BE100, PC, CAM600, CAMG22, etc.) in the NS5B region.

[0260] The above-mentioned type 4 peptides polypeptides comprise atleast an amino acid sequence selected from any HCV type 4 polyproteinwith the exception of core sequence as disclosed by Simmonds et al.(1993, EG-29, see FIG. 5).

[0261] According to yet another aspect, the present invention relates toa composition as defined above, wherein said contiguous sequence isselected from any of the following HCV amino acid type 5 sequences:

[0262] a sequence having more than 93%, preferably more than 94%, mostpreferably more than 95% homology in the region spanning Core positions1 to 191 to any of the amino acid sequences as represented in SEQ ID NO42, 44, 46, 48, 50, 52 or 54 (PC sequences) and SEQ ID NO 152 (BE95) asshown in FIG. 5;

[0263] a sequence having more than 73%, preferably more than 74%, mostpreferably more than 76% homology in the region spanning E1 positions192 to 319 to any of the amino acid sequences as represented in SEQ IDNO 42, 44, 46, 48, 50, 52 or 54 (PC sequences) as shown in FIG. 5

[0264] a sequence having a more than 78%, preferably more than 80%, mostpreferably more than 83% homology to any of the amino acid sequences asrepresented in SEQ ID NO 42, 44, 46, 48, 50, 52, 54, 154, 156 (BE95,BE100) (PC sequences) in the region spanning positions 1 to 319 of theCore/E1 region as shown in FIG. 5;

[0265] a sequence having more than 90%, preferably more than 91%, mostpreferably more than 92% homology to any of the amino acid sequencesrepresented in SEQ ID NO 56 to 58 (PC sequences) in the region spanningpositions 1286 to 1403 of the NS3 region as shown in FIG. 7 or 11;

[0266] a sequence having more than 66%, more particularly 68%, mostparticularly 70% or more homology to any of the amino acid sequences asrepresented in SEQ ID NO 60 or 62 (PC sequences) in the region spanningpositions 1646 to 1764 of the NS3/4 region as shown in FIG. 7 or 11.

[0267] According to yet another embodiment, the present inventionrelates to a composition as defined above, wherein said contiguoussequence is selected from any of the following HCV amino acid type 2 dsequences:

[0268] a sequence having more than 83%, preferably more than 85%, mostpreferably more than 87% homology to the amino acid sequence asrepresented in SEQ ID NO 144 (NE92) in the region spanning positions 1to 319 of the Core/E1 region as shown in FIG. 5;

[0269] a sequence having more than 79%, preferably more than 81%, mostpreferably more than 84% homology in the region spanning E1 positions192 to 319 to the amino acid sequence as represented in SEQ ID NO 144(NE92) as shown in FIG. 12;

[0270] a sequence having more than 95%, more particularly 96%, mostparticularly 97% or more homology to the amino acid sequence asrepresented in SEQ ID NO 146 (NE92) in the region spanning positions2645 to 2757 of the NS5B region as shown in FIG. 2.

[0271] The present invention also relates to a recombinant vector,particularly for cloning and/or expression, with said recombinant vectorcomprising a vector sequence, an appropriate prokaryotic, eukaryotic orviral promoter sequence followed by the nucleotide sequences as definedabove, with said recombinant vector allowing the expression of any oneof the HCV type 2 and/or HCV type 3 and/or type 4 and/or type 5 derivedpolypeptides as defined above in a prokaryotic, or eukaryotic host or inliving mammals when injected as naked DNA, and more particularly arecombinant vector allowing the expression of any of the following HCVtype 2 d, type 3, type 4 or type 5 polypeptides spanning the followingamino acid positions:

[0272] a polypeptide starting at position 1 and ending at any positionin the region between positions 70 and 326, more particularly apolypeptide spanning positions 1 to 70, 1 to 85, positions 1 to 120,positions 1 to 150, positions 1 to 191, positions 1 to 200, forexpression of the Core protein, and a polypeptide spanning positions 1to 263, positions 1 to 326, for expression of the Core and E1 protein;

[0273] a polypeptide starting at any position in the region betweenpositions 117 and 192, and ending at any position in the region betweenpositions 263 and 326, for expression of E1, or forms that have theputative membrane anchor deleted (positions 264 to 293 plus or minus 8amino acids);

[0274] a polypeptide starting at any position in the region betweenpositions 1556 and 1688, and ending at any position in the regionbetween positions 1739 and 1764, for expression of the NS4 regions, moreparticularly a polypeptide starting at position 1658 and ending atposition 1711 for expression of the NS4a antigen, and more particularly,a polypeptide starting at position 1712 and ending between positions1743 and 1972, for example 1712-1743, 1712-1764, 1712-1782, 1712-1972,1712 to 1782 and 1902 to 1972 for expression of the NS4b protein orparts thereof.

[0275] The term “vector” may comprise a plasmid, a cosmid, a phage, or avirus.

[0276] In order to carry out the expression of the polypeptides of theinvention in bacteria such as E. coli or in eukaryotic cells such as inS. cerevisiae, or in cultured vertebrate or invertebrate hosts such asinsect cells, Chinese Hamster Ovary (CHO), COS, BHK, and MDCK cells, thefollowing steps are carried out:

[0277] transformation of an appropriate cellular host with a recombinantvector, in which a nucleotide sequence coding for one of thepolypeptides of the invention has been inserted under the control of theappropriate regulatory elements, particularly a promoter recognized bythe polymerases of the cellular host and, in the case of a prokaryotichost, an appropriate ribosome binding site (RBS), enabling theexpression in said cellular host of said nucleotide sequence. In thecase of an eukaryotic host any artificial signal sequence or pre/prosequence might be provided, or the natural HCV signal sequence might beemployed, e.g. for expression of E1 the signal sequence starting betweenamino acid positions 117 and 170 and ending at amino acid position 191can be used, for expression of NS4, the signal sequence starting betweenamino acid positions 1646 and 1659 can be used, culture of saidtransformed cellular host under conditions enabling the expression ofsaid insert.

[0278] The present invention also relates to a composition as definedabove, wherein said polypeptide is a recombinant polypeptide expressedby means of an expression vector as defined above. The present inventionalso relates to a composition as defined above, for use in a method forimmunizing a mammal, preferably humans, against HCV comprisingadministring a sufficient amount of the composition possibly accompaniedby pharmaceutically acceptable adjuvants, to produce an immune response,more particularly a vaccine composition including HCV type 3polypeptides derived from the Core, E1 or the NS4 region and/or HCV type4 and/or HCV type 5 polypeptides and/or HCV type 2 d polypeptides.

[0279] The present invention also relates to an antibody raised uponimmunization with a composition as defined above by means of a processas defined above, with said antibody being reactive with any of thepolypeptides as defined above, and with said antibody being preferably amonoclonal antibody.

[0280] The monoclonal antibodies of the invention can be produced by anyhybridoma liable to be formed according to classical methods fromsplenic cells of an aniunal, particularly from a mouse or rat, immunizedagainst the HCV polypeptides according to the invention, or muteinsthereof, or fragments thereof as defined above on the one hand, and ofcells of a myeloma cell line on the other hand, and to be selected bythe ability of the hybridoma to produce the monoclonal antibodiesrecognizing the polypeptides which has been initially used for theimmunization of the animals.

[0281] The antibodies involved in the invention can be labelled by anappropriate label of the enzymatic, fluorescent, or radioactive type.

[0282] The monoclonal antibodies according to this preferred embodimentof the invention may be humanized versions of mouse monoclonalantibodies made by means of recombinant DNA technology, departing fromparts of mouse and/or human genomic DNA sequences coding for H and Lchains or from cDNA clones coding for H and L chains.

[0283] Alternatively the monoclonal antibodies according to thispreferred embodiment of the invention may be human monoclonalantibodies. These antibodies according to the present embodiment of theinvention can also be derived from human peripheral blood lymphocytes ofpatients infected with type 3, type 4 or type 5 HCV, or vaccinatedagainst HCV. Such human monoclonal antibodies are prepared, forinstance, by means of human peripheral blood lymphocytes (PBL)repopulation of severe combined immune deficiency (SCID) mice (forrecent review, see Duchosal et al. 1992).

[0284] The invention also relates to the use of the proteins of theinvention, muteins thereof, or peptides derived therefrom for theselection of recombinant antibodies by the process of repertoire cloning(Persson et al., 1991).

[0285] Antibodies directed to peptides derived from a certaing genotypemay be used either for the detection of such HCV genotypes, or astherapeutic agents.

[0286] The present invention also relates to the use of a composition asdefined above for incorporation into an immunoassay for detecting HCV,present in biological sample liable to contain it, comprising at leastthe following steps:

[0287] (i) contacting the biological sample to be analyzed for thepresence of HCV antibodies with any of the compositions as defined abovepreferably in an immobilized form under appropriate conditions whichallow the formation of an immune complex, wherein said polypeptide canbe a biotinylated polypeptide which is covalently bound to a solidsubstrate by means of streptavidin or avidin complexes,

[0288] (ii) removing unbound components,

[0289] (iii) incubating the immune complexes formed with heterologousantibodies, which specifically bind to the antibodies present in thesample to be analyzed, with said heterologous antibodies havingconjugated to a detectable label under appropriate conditions,

[0290] (iv) detecting the presence of said immunecomplexes visually orby means of densitometry and inferring the HCV serotype present from theobserved hybridization pattern.

[0291] The present invention also relates to the use of a composition asdefined above, for incorporation into a serotyping assay for detectingone or more serological types of HCV present in a biological sampleliable to contain it, more particularly for detecting E1 and NS4antigens or antibodies of the different types to be detected combined inone assay format, comprising at least the following steps:

[0292] (i) contacting the biological sample to be analyzed for thepresence of HCV antibodies or antigens of one or more serological types,with at least one of the compositions as defined above, an immobilizedform under appropriate conditions which allow the formation of animmunecomplex,

[0293] (ii) removing unbound components,

[0294] (iii) incubating the immunecomplexes formed with heterologousantibodies, which specifically bind to the antibodies present in thesample to be analyzed, with said heterologous antibodies havingconjugated to a detectable label under appropriate conditions,

[0295] (iv) detecting the presence of said immunecomplexes visually orby means of densitometry and inferring the presence of one or more HCVserological types present from the observed binding pattern.

[0296] The present invention also relates to the use of a composition asdefined above, for immobilization on a solid substrate and incorporationinto a reversed phase hybridization assay, preferably for immobilizationas parallel lines onto a solid support such as a membrane strip, fordetermining the presence or the genotype of HCV according to a method asdefined above.

[0297] The present invention thus also relates to a kit for determiningthe presence of HCV genotypes as defined above present in a biologicalsample liable to contain them, comprising:

[0298] possibly at least one primer composition containing any primerselected from those defined above or any other HCV type 3 and/or HCVtype 4, and/or HCV type 5, or universal HCV primers,

[0299] at least one probe composition as defined above, with said probesbeing preferentially immobilized on a solid substrate, and morepreferentially on one and the same membrane strip,

[0300] a buffer or components necessary for producing the bufferenabling hybridization reaction between these probes and the possiblyamplified products to be carried out,

[0301] means for detecting the hybrids resulting from the precedinghybriziation,

[0302] possibly also including an automated scanning and interpretationdevice for inferring the HCV genotypes present in the sample from theobserved hybridization pattern.

[0303] The genotype may also be detected by means of a type-specificantibody as defined above, which is linked to any polynucleotidesequence that can afterwards be amplified by PCR to detect the immunecomplex formed (Immuno-PCR, Sano et al., 1992);

[0304] The present invention also relates to a kit for determining thepresence of HCV antibodies as defined above present in a biologicalsample liable to contain them, comprising:

[0305] at least one polypeptide composition as defined above,preferentially in combination with other polypeptides or peptides fromHCV type 1, HCV type 2 or other types of HCV, with said polypeptidesbeing preferentially immobilized on a solid substrate, and morepreferentially on one and the same membrane strip,

[0306] a buffer or components necessary for producing the bufferenabling binding reaction between these polypeptides and the antibodiesagainst HCV present in the biological sample,

[0307] means for detecting the immunecomplexes formed in the precedingbinding reaction,

[0308] possibly also including an automated scanning and interpretationdevice for inferring the HCV genotypes present in the sample from theobserved binding pattern.

[0309] Figure Legends

[0310]FIG. 1

[0311] Alignment of consensus nucleotide sequences for each of the type3 a isolates BR34, BR36, and BR33, deduced from the clones with SEQ IDNO 1, 5, 9; type 4 isolates GB48, GB116, GB215, GB358, GB549, GB809,CAM600, CAMG22, GB438, CAR4/1205, CAR1/501 (SEQ ID NO. 106, 108, 110,112, 114, 116, 201, 203, 205, 207, 209 and 211); type 5 a isolates BE95and BE96 (SEQ ID NO 159 and 161) and type 2 d isolate NE92 (SEQ ID NO145) from the region between nucleotides 7932 and 8271, with knownsequences from the corresponding region of isolates HCV-1, HCV-J, HC-J6,HC-J8, T1 and T9, and others as shown in Table 3.

[0312]FIG. 2

[0313] Alignment of amino acids sequences deduced from the nucleic acidsequences as represented in FIG. 1 from the subtype 3 a clones BR34 (SEQID NO 2, 4), BR36 (SEQ ID NO 6, 8) and BR33 (SEQ ID NO 10, 12), thesubtype 3 c clone BE98 (SEQ ID NO 150), and the type 4 clones GB48 (SEQID NO 107), GB116 (SEQ ID NO 109), GB215 (SEQ ID NO 111), GB358 (SEQ IDNO 113), GB549 (SEQ ID NO 115) GB809 (SEQ ID NO 117); CAM600, CAMG22,GB438, CAR4/1205, CAR1/501 (SEQ ID NO 202, 204, 206, 208, 210, 212); thetype 5 a clones BE95 and BE96 (SEQ ID NO 160 and 162); as well as thesubtype 2 d isolate NE92 (SEQ ID NO 146) from the region between aminoacids 2645 to 2757 with known sequences from the corresponding region ofisolates HCV-I, HCV-J, HC-J6, and HC-J8, T1 and T9, and other sequencesas shown in Table 3.

[0314]FIG. 3

[0315] Aligment of type 2 d, 3 c, 4 and 5 a nucleotide sequences fromisolates NE92, BE98, GB358, GB809, CAM600, GB724, BE95 (SEQ ID NO 143,147, 191, 163, 165, 193 and 151) in the Core region between nucleotidepositions 1 and 500, with known sequences from the corresponding regionof type 1, type 2, type 3 and type 4 sequences.

[0316]FIG. 4

[0317] Alignment of nucleotide sequences for the subtype 2 d isolateNE92 (SEQ ID NO 143), the type 4 isolates GB358 (SEQ ID NO 118 and 187),GB549 (SEQ ID NO 120 and 175), and GB809-2 (SEQ ID NO 122 and 169), GB809-4, BG116, GB215, CAM600, CAMG22, CAMG27, GB438, CAR4/1205, CAR4/901(SEQ ID NO 189, 183, 185, 167, 171, 173, 177, 179, 181), sequences foreach of the subtype 3 a isolates HD10, BR36, and BR33, (SEQ ID NO 13,15, 17 (HD10), 19, 21 (BR36) and 23, 25 or 27 (BR23) and the subtype 5 aisolates BE95 and BE100 (SEQ ID NO 143 and 195) from the region betweennucleotides 379 and 957, with known sequences from the correspondingregion of type 1 and 2 and 3.

[0318]FIG. 5

[0319] Alignment of amino acid sequences deduced from the new HCVnucleotide sequences of the Core/E1 region of isolates BR33, BR36, HD10,GB358, GB549, and GB809, PC or BE95, CAM600, and GB724 (SEQ ID NO. 14,20, 24, 119 or 192, 121, 123 or 164, 54 or 152, 166 and 194) from theregion between positions 1 and 319, with known sequences from type 1 a(HCV-1), type 1 b (HCV-J), type 2 a (HC-JG), type 2 b (HC-J8), NZL1,HCV-TR, positions 7-89 of type 3 a (E-b1), and positions 8-88 of type 4a (EG-29). V-Core, variable region with type-specific features in thecore protein, V1, variable region 1 of the E1 protein, V2, variableregion 2 of the E1 protein, V3, variable region 3 of the E1 protein, V4,variable region 4 of the E1 protein, V5, variable region 5 of the E1protein.

[0320]FIG. 6

[0321] Alignment of nucleotide sequences of isolates HCCL53, HD10 andBR36, deduced from clones with SEQ ID NO 29, 31, 33, 35, 37 and 39, fromthe NS3/4 region between nucleotides 4664 to 5292, with known sequencesfrom the corresponding region of isolates HCV-1, HCV-J, HC-J6, andHC-J8, EB1, EB2, EB6 and EB7.

[0322]FIG. 7

[0323] Alignment of amino acid sequences deduced from the new HCVnucleotide sequences of the NS3/NS4 region of isolate BR36 (SEQ ID NO36, 38 and 40) and BE95 (SEQ ID NO 270). NS4-1, indicates the regionthat was synthesized as synthetic peptide 1 of the NS4 region, NS4-5,indicates the region that was synthesized as synthetic peptide 5 of theNS4 region; NS4-7, indicates the region that was synthesized assynthetic peptide 7 of the NS4 region.

[0324]FIG. 8

[0325] Reactivity of the three LIPA-selected (Stuyver et al., 1993) type3 sera on the Inno-LIA HCV Ab II assay (Innogenetics) (left), and on theNS4-LIA test. For the NS4-LIA test, NS4-1, NS4-5, and NS4-7 peptideswere synthesized based on the type 1 (HCV-1), type 2 (HC-J6) and type 3(BR36) prototype isolate sequences as shown in Table 4, and applied asparallel lines onto a membrane strip as indicated. 1, serum BR33, 2,serum HD10, 3, serum DKH.

[0326]FIG. 9

[0327] Nucleotide sequences of Core/E1 clones obtained from the PCRfragments PC-2, PC-3, and PC-4, obtained from serum BE95 (PC-2-1 (SEQ IDNO 41), PC-2-6 (SEQ ID NO 43), PC-4-1 (SEQ ID NO 45), PC-4-6 (SEQ ID NO47), PC-34 (SEQ ID NO 49), and PC-3-8 (SEQ ID NO 51)) of subtype 5 aisolate BE95.

[0328] A consensus sequence is shown for the Core and E1 region ofisolate BE95, presented as PC C/E1 with SEQ ID NO 53. Y, C or T, R, A orG, S, C or G.

[0329]FIG. 10

[0330] Alignment of nucleotide sequences of clones with SEQ ID NO 197and 199 (PC sequences, see also SEQ ID NO 55, 57, 59) and SEQ ID NO 35,37 and 39 (BR36 sequences) from the NS3/4 region between nucleotides3856 to 5292, with known sequences from the corresponding region ofisolates HCV-1, HCV-J, HC-J6, and HC-J8.

[0331]FIG. 11

[0332] Alignment of amino acid sequences of subtype 5 a BE95 isolate PCclones with SEQ ID NO 56 and 58, from the NS3/4 region between aminoacids 1286 to 1764, with known sequences from the corresponding regionof isolates HCV-1, HCV-J, HC-J6, and HC-J8.

[0333]FIG. 12

[0334] Aligment of amino acid sequences of subtype 5 a isolate BE95 (SEQID NO 158) in the E1/E2 region spanning positions 328 to 546, with knownsequnces from the corresponding region of isolates HCV-1, HCV-J, HC-J6,HC-J8, NZL1 and HCV-TR (see Table 3).

[0335]FIG. 13

[0336] Alignment of the nucleotide sequences of subtype 5 a isolate BE95(SEQ ID NO 157) in the E1/E2 region with known HCV sequences as shown inTable 3.

EXAMPLES Example 1 The NS5b region of HCV type 3

[0337] Type 3 sera, selected by means of the INNO-LiPA HCV research kit(Stuyver et al., 1993) from a number of Brazilian blood donors, werepositive in the HCV antibody ELISA (Innotest HCV Ab II; Innogenetics)and/or in the INNO-LIA HCV Ab II confirmation test (Innogenetics). Onlythose sera that were positive after the first round of PCR reactions(Stuyver et al., 1993) were retained for further study.

[0338] Reverse transcription and nested PCR: RNA was extracted from 50μl serum and subjected to cDNA synthesis as described (Stuyver et al.,1993). This cDNA was used as template for PCR, for which the totalvolume was increased to 50 μl containing 10 pmoles of each primer, 3 μlof 10×Pfu buffer 2 (Stratagene) and 2.5 U of Pfu DNA polymerase(Stratagene). The cDNA was amplified over 45 cycles consisting of 1 min94° C., 1 min 50° C. and 2 min 72° C. The amplified products wereseparated by electrophoresis, isolated, cloned and sequenced asdescribed (Stuyver et al., 1993).

[0339] Type 3 a and 3 b-specific primers in the NS5 region were selectedfrom the published sequences (Morn et al., 1992) as follows:

[0340] for type 3 a: HCPr161(+): 5′-ACCGGAGGCCAGGAGAGTGATCTCCTCC-3′ (SEQID NO 63) and HCPr162(−): 5′-GGGCTGCTCTATCCTCATCGACGCCATC-3′ (SEQ ID NO64);

[0341] for type 3 b: HCPr163(+): 5′-GCCAGAGGCTCGGAAGGCGATCAGCGCT-3′ (SEQID NO 65) and HCPr164(−): 5′-GAGCTGCTCTGTCCTCCTCGACGCCGCA-3′ (SEQ ID NO66)

[0342] Using the Line Probe Assay (LiPA) (Stuyver et al., 1993), sevenhigh-titer type 3 sera were selected and subsequently analyzed with theprimer sets HCPr161/162 for type 3 a, and HCPr163/164 for type 3 b. Noneof these sera was positive with the type 3 b primers. NS5 PCR fragmentsobtained using the type 3 a primers from serum BR36 (BR36-23), serumBR33 (BR33-2) and serum BR34 (BR34-4) were selected for cloning. Thefollowing sequences were obtained from the PCR fragments

[0343] From fragment BR34-4:

[0344] BR34-4-20 (SEQ ID NO 1), BR34-4-19 (SEQ ID NO 3)

[0345] From fragment BR36-23:

[0346] BR36-23-18 (SEQ ID NO 5), BR36-23-20 (SEQ ID NO 7)

[0347] From fragment BR33-2:

[0348] BR33-2-17 (SEQ ID NO 9), BR33-2-21 (SEQ ID NO 11)

[0349] An alignment of sequences with SEQ ID NO 1, 5 and 9 with knownsequences is given in FIG. 1. An alignment of the deduced amino acidsequences is shown in FIG. 2. The 3 isolates are very closely related toeach other (mutual homologies of about 95%) and to the publishedsequences of type 3 a (Mori et al., 1992), but are only distantlyrelated to type 1 and type 2 sequences (Table 5). Therefore, it isclearly demonstrated that NS5 sequences from LiPA-selected type 3 seraare indeed derived from a type 3 genome. Moreover, by analyzing the NS5region of serum BR34, for which no 5′UR sequences were determined asdescribed in Stuyver et al. (1993), the excellent correlation betweentyping by means of the LiPA and genotyping as deduced from nucleotidesequencing was further proven.

Example 2 The Core/E1 region of HCV type 3

[0350] After aligning the sequences of HCV-1 (Choo et al., 1991), HCV-J(Kato et al., 1990), HC-J6 (Okamoto et al., 1991), and HC-J8 (Okamoto etal., 1992), PCR primers were chosen in those regions of little sequencevariation. Primers HCPr23(+): 5′-CTCATGGGGTACATTCCGCT-3′ (SEQ ID NO 67)and HCPr54(−): 5′-TATTACCAGTTCATCATCATATCCCA-3′ (SEQ ID NO 68), weresynthesized on a 392 DNA/RNA synthesizer (Applied Biosystems). This setof primers was selected to amplify the sequence from nucleotide 397 to957 encoding amino acids 140 to 319 (Kato et al., 1990): 52 amino acidsfrom the carboxyterminus of core and 128 amino acids of E1 (Kato et al.,1990). The amplification products BR36-9, BRR33-1, and HD10-2 werecloned as described (Stuyver et al., 1993). The following clones wereobtained from the PCR fragments:

[0351] From fragment HD10-2:

[0352] HD10-2-5 (SEQ ID NO 13), HD10-2-14 (SEQ ID NO 15), HD10-2-21 (SEQID NO 17)

[0353] From fragment BR36-9:

[0354] BR36-9-13 (SEQ ID NO 19), BR36-9-20 (SEQ ID NO 21),

[0355] From fragment BR33-1:

[0356] BR33-1-10 (SEQ ID NO 23), BR33-1-19 (SEQ ID NO 25), BR33-1-20(SEQ ID NO 27),

[0357] An alignment of the type 3 E1 nucleotide sequences (HD10, BR36,BR33) with SEQ ID NO 13, 19 and 23 with known E1 sequences is presentedin FIG. 4. Four variations were detected in the E1 clones from serumHD10 and BR36, while only 2 were found in BR33. All are silent thirdletter variations, with the exception of mutations at position 40 (L toP) and 125 (M to I). The homologies of the type 3 E1 region (withoutcore) with type 1 and 2 prototype sequences are depicted in Table 5.

[0358] In total, 8 clones covering the core/E1 region of 3 differentisolates were sequenced and the E1 portion was compared with the knowngenotypes (Table 3) as shown in FIG. 5. After computer analysis of thededuced amino acid sequence, a signal-anchor sequence at the corecarboxyterminus was detected which might, through analogy with type 1 b(Hijikata et al., 1991), promote cleavage before the LEWRN sequence(position 192, FIG. 5). The L-to-P mutation in one of the HD10-2 clonesresides in this signal-anchor region and potentially impairs recognitionby signal peptidase (computer prediction). Since no examples of suchsubstitutions were found at this position in previously describedsequences, this mutation might have resulted from reverse transcriptaseor Pu polymerase misincorporation. The 4 amino-terminal potentialN-linked glycosylation sites, which are also present in HCV types 1 aand 2, remain conserved in type 3. The N-glycosylation site in type 1 b(aa 250, Kato et al., 1990) remains a unique feature of this subtype.All E1 cysteines, and the putative transmembrane region (aa 264 to 293,computer prediction) containing the aspartic acid at position 279, areconserved in all three HCV types. The following hypervariable regionscan be delineated: V1 from aa 192 to 203 (numbering according to Kato etal., 1990), V2 (213-223), V3 (230-242), V4 (248-257), and V5 (294-303).Such hydrophilic regions are thought to be exposed to the host defensemechanisms. This variability might therefore have been induced by thehost's immune response. Additional putative N-linked glycosylation sitesin the V4 region in all type 1 b isolates known today and in the V5region of HC-J8 (type 2 b) possibly further contribute to modulation ofthe immune response. Therefore, analysis of this region, in the presentinvention, for type 3 and 4 sequences has been instrumental in thedelineation of epitopes that reside in the V-regions of E1, which willbe critical for future vaccine and diagnostics development.

Example 3 The NS3/NS4 region of HCV Type 3

[0359] For the NS3/NS4 border region, the folllowing sets of primerswere selected in the regions of little sequence variability afteraligning the sequences of HCV-1 (Choo et al., 1991), HCV-J (Kato et al.,1990), HC-J6 (Okamoto et al., 1991), and HC-J8 (Okamoto et al., 1992)(smaller case lettering is used for nucleotides added for cloningpurposes):

[0360] set A: HCPr116(+): 5′-ttttAAATACATCATGRCITGYATG-3′ (SEQ ID NO 69)

[0361] HCPr66 (−): 5′-ctattaTTGTATCCCRCTGATGAARTTCCACAT-3′ (SEQ ID NO70) set B: HCPr116(+): 5′-ttttAAATACATCATGRCITGYATG-3′ (SEQ ID NO 69)HCPr118(−): 5′-actagtcgactaYTGIATICCRCTIATRWARTTCCACAT-3′ (SEQ ID NO 71)set C: HCPr117(+): 5′-ttttAAATACATCGCIRCITGCATGCA-3′ (SEQ ID NO 72)HCPr66 (−): 5′-ctattaTTGTATCCCRCTGATGAARTTCCACAT-3′ (SEQ ID NO 70) setD: HCPr117(+): 5′-ttttAAATACATCGCIRCITGCATGCA-3′ (SEQ ID NO 72)HCPr118(−): 5′-actagtcgactaYTGIATICCRCTIATRWARTTCCACAT-3′ (SEQ ID NO 71)set E: HCPr116(+): 5′-ttttAAATACATCATGRCITGYATG-3′ (SEQ ID NO 69)HCPr119(−): actagtcgactaRTTIGCIATIAGCCG/TRTTCATCCAYTG-3′ (SEQ ID NO 73)set F: HCPr117(+): 5′-ttttAAATACATCGCIRCITGCATGCA-3′ (SEQ ID NO 72)HCPr119(−): actagtcgactaRTTIGCIATIAGCCG/TRTTCATCCAYTG-3′ (SEQ ID NO 73)set G: HCPr131(+): 5′-ggaattctagaCCITCITGGGAYGARAYITGGAARTG-3′ (SEQ IDNO 74) HCPr66 (−): 5′-ctattaTTGTATCCCRCTGATGAARTTCCACAT-3′ (SEQ ID NO70) set H: HCPr130(+): 5′-ggaattctagACIGCITAYCARGCIACIGTITGYGC-3′ (SEQID NO 75) HCPr66 (−): 5′-ctattaTTGTATCCCRCTGATGAARTTCCACAT-3′ (SEQ ID NO70) set I: HCPr134(+): 5′-CATATAGATGCCCACTTCCTATC-3′ (SEQ ID NO 76)HCPr66 (−): 5′-ctattaTTGTATCCCRCTGATGAARTTCCACAT-3′ (SEQ ID NO 70) setJ: HCPr131(+): 5′-ggaattctagaCCITCITGGGAYGARAYITGGAARTG-3′ (SEQ ID NO74) HCPr118(−): 5′-actagtcgactaYTGIATICCRCTIATRWARTTCCACAT-3′ (SEQ ID NO71) set K: HCPr130(+): 5′-ggaattctagACIGCITAYCARGCIACIGTITGYGC-3′ (SEQID NO 75) HCPr118(−): 5′-actagtcgactaYTGIATICCRCTIATRWARTTCCACAT-3′ (SEQID NO 71) set L: HCPr134(+): 5′-CATATAGATGCCCACTTCCTATC-3′ (SEQ ID NO76) HCPr118(−): 5′-actagtcgactaYTGIATICCRCTIATRWARTTCCACAT-3′ (SEQ ID NO71) set M: HCPr3(+): 5′-GTGTGCCAGGACCATC-3′ (SEQ ID NO 77) and HCPr4(−):5′-GACATGCATGTCATGATGTA-3 (SEQ ID NO 78) set N: HCPr3(+):5′-GTGTGCCAGGACCATC-3′ (SEQ ID NO 77) and HCPr118(−):5′-actagtcgactaYTGIATICCRCTIATRWARTTCCACAT-3′ (SEQ ID NO 71) set O:HCPr3(+): 5′-GTGTGCCAGGACCATC-3′ (SEQ ID NO 77) and HCPr66 (−):5′-ctattaTTGTATCCCRCTGATGAARTTCCACAT-3′ (SEQ ID NO 70)

[0362] No PCR products could be obtained with the sets of primers A, B,C, D, E, F, G, H, I, J, K, L, M, and N, on random-primed cDNA obtainedfrom type 3 sera. With the primer set O no fragment could be amplifiedfrom type 3 sera. However, a smear containing a few weakly stainablebands was obtained from serum BR36. After sequence analysis of severalDNA fragments, purified and cloned from the area around 300 bp on theagarose gel, only one clone, HCC153 (SEQ ID NO 29), was shown to containHCV information. This sequence was used to design primer HCPr152.

[0363] A new primer set P was subsequently tested on several sera.

[0364] set P: HCPr152(+): 5′-TACGCCTCTTCTATATCGGTTGGGGCCTG-3′ (SEQ ID NO79) and HCPr66(−): 5′-CTATTATTGTATCCCRCTGATGAARTTCCACAT-3′ (SEQ ID NO70)

[0365] The 464-bp HCPr152/66 fragment was obtained from serum BR36(BR36-20) and serum HD10 (HD10-1). The following clones were obtainedfrom these PCR products:

[0366] From fragment HD10-1:

[0367] HD10-1-25 (SEQ ID NO 31), HD10-1-3 (SEQ ID NO 33),

[0368] From fragment BR36-20:

[0369] BR36-20-164 (SEQ ID NO 35), BR36-20-165 (SEQ ID NO 37),BR36-20-166 (SEQ ID NO 39),

[0370] The nucleotide sequences obtained from clones with SEQ ID NO 29,31, 33, 35, 37 or 39 are shown aligned with the sequences of prototypeisolates of other types of HCV in FIG. 6. In addition to one silent 3rdletter variation, one 2nd letter mutation resulted in an E to Gsubstitution at position 175 of the deduced amino acid sequence of BR36(FIG. 7). Serum HD10 clones were completely identical. The two type 3isolates were nearly 94% homologous in this NS4 region. The homologieswith other types are presented in Table 5.

Example 4 Analysis of the anti-NS4 response to type-specific peptides

[0371] As the NS4 sequence contains the information for an importantepitope cluster, and since antibodies towards this region seem toexhibit little cross-reactivity (Chan et al., 1991), it was worthwhileto investigate the type-specific antibody response to this region. Foreach of the 3 genotypes, HCV-1 (Choo et al., 1991), HC-J6 (Okamoto etal., 1991) and BR36 (present invention), three 20-mer peptides weresynthesized covering the epitope region between amino acids 1688 and1743 (as depicted in table 6). The synthetic peptides were applied asparallel lines onto membrane strips. Detection of anti-NS4 antibodiesand color development was performed according to the procedure describedfor the INNO-LIA HCV Ab 11 kit (Innogenetics, Antwerp). Peptidesynthesis was carried out on a 9050 PepSynthesizer (Millipore). Afterincubation with 15 LiPA-selected type 3 sera, 9 samples showedreactivity towards NS4 peptides of at least 2 different types, but aclearly positive reaction was observed for 3 sera (serum BR33, HD30 andDKH) on the type 3 peptides, while negative (serum BR33 and HD30) orindeterminate (serum DKH) on the type 1 and type 2 NS4 peptides; 3 seratested negative for anti-NS4 antibodies (FIG. 8). Using the se membranestrips coated with the 9 peptides as indicated above and as shown inFIG. 8, 38 type 1 sera (10 type 1 a and 28 type 1 b), 11 type 2 sera (10type 2 a and 1 type 2 b), 12 type 3 a sera and 2 type 4 sera (asdetermined by the LiPA procedure) were also tested. As shown in Table 8,the sera reacted in a genotype-specific manner with the NS4 epitopes.These results demonstrate that type-specific anti-NS4 antibodies can bedetected in the sera of some patients. Such genotype-specific syntheticpeptides might be employed to develop serotyping assays, for example amixture of the nine peptides as indicated above, or combined with theNS4 peptides from the HCV type 4 or 6 genotype or from new genotypescorresponding to the region between amino acids 1688 and 1743, orsynthetic peptides of the NS4 region between amino acids 1688 and 1743of at least one of the 6 genotypes, combined with the E1 protein ordeletion mutants thereof, or synthetic E1 peptides of at least one ofthe genotypes. Such compositions could be further extended withtype-specific peptides or proteins, including for example the regionbetween amino acids 68 and 91 of the core protein, or more preferablythe region between amino acids 68 and 78. Furthermore, suchtype-specific antigens may be advantageously used to improve currentdiagnostic screening and confirmation assays and/or HCV vaccines.

Example 5 The Core and E1 regions of HCV type 5

[0372] Sample BE95 was selected from a group of sera that reactedpositive in a prototype Line Probe Assay as described earlier (Stuyveret al., 1993), because a high-titer of HCV RNA could be detected,enabling cloning of fragments by a single round of PCR. As no sequencesfrom any coding region of type 5 has been disclosed yet, syntheticoligonucleotides for PCR amplification were chosen in the regions oflittle sequence variation after aligning the sequences of HCV-1 (Choo etal., 1991), HCV-J (Kato et al., 1990), HC-J6 (Okamoto et al., 1991),HC-J8 (Okamoto et al., 1992), and the new type 3 sequences of thepresent invention HD10, BR33, and BR36 (see FIG. 5, Example 2). Thefollowing sets of primers were synthesized on a 392 DNA/RNA synthesizer(Applied Biosystems): Set 1: HCPr52(+): 5′-atgTTGGGTAAGGTCATCGATACCCT-3′(SEQ ID NO 80) and HCPr54(−): 5′-ctattaCCAGTTCATCATCATATCCCA-3′ (SEQ IDNO 78) Set 2: HCPr41(+): 5′-CCCGGGAGGTCTCGTAGACCGTGCA-3′ (SEQ ID NO 81)and HCPr40(−): 5′-ctattaAAGATAGAGAAAGAGCAACCGGG-3′ (SEQ ID NO 82) Set 3:HCPr41(+): 5′-CCCGGGAGGTCTCGTAGACCGTGCA-3′ (SEQ ID NO 81) and HCPr54(−):5′-ccattaCCAGTTCATCATCATATCCCA-3′ (SEQ ID NO 78)

[0373] The three sets of primers were employed to amplify the regions ofthe type 5 isolate PC as described (Stuyver et al., 1993). Set 1 wasused to amplify the E1 region and yielded fragment PC-4, set 2 wasdesigned to yield the Core region and yielded fragment PC-2. Set 3 wasused to amplify the Core and E1 region and yielded fragment PC-3. Thesefragments were cloned as described (Stuyver et al., 1993). The followingclones were obtained from the PCR fragments:

[0374] From fragment PC-2:

[0375] PC-2-1 (SEQ ID NO 41), PC-2-6 (SEQ ID NO 43),

[0376] From fragment PC-4:

[0377] PC-4-1 (SEQ ID NO 45), PC-4-6 (SEQ ID NO 47),

[0378] From fragment PC-3:

[0379] PC-3-4 (SEQ ID NO 49), PC-3-8 (SEQ ID NO 51)

[0380] An alignment of sequences with SEQ ID NO 41, 43, 45, 47, 49 and51, is given in FIG. 9. A consensus amino acid sequence (PC C/E1; SEQ IDNO 54) can be deduced from each of the 2 clones cloned from each of thethree PCR fragments as depicted in FIG. 5, which overlaps the regionbetween nucleotides I and 957 (Kato et al., 1990). The 6 clones are veryclosely related to each other (mutual homologies of about 99.7%).

[0381] An alignment of nucleotide sequence with SEQ ID NO 53 or 151 (PCC/E1 from isolate BE95) with known nucleotide sequences from the Core/E1region is given in FIG. 3. The clone is only distantly related to type1, type 2, type 3 and type 4 sequences (Table 5).

Example 6 NS3/NS4 region of HCV type 5

[0382] Attempts were undertaken to clone the NS3/NS4 region of theisolate BE95, described in example 5. The folllowing sets of primerswere selected in the regions of little sequence variability afteraligning the sequences of HCV-1 (Choo et al., 1991), HCV-J (Kato et al.,1991), HC-J6 (Okamoto et al., 1991), and HC-J8 (Okamoto et al., 1992)and of the sequences obtained from type 3 sera of the present invention(SEQ ID NO 31, 33, 35, 37 and 39); smaller case lettering is used fornucleotides added for cloning purposes: set A: HCPr116(+):5′-ttttAAATACATCATGRCITGYATG-3′ (SEQ ID NO 66) HCPr66 (−):5′-ctattaTTGTATCCCRCTGATGAARTTCCACAT-3′ (SEQ ID NO 70) set B:HCPr116(+): 5′-ttttAAATACATCATGRCITGYATG-3′ (SEQ ID NO 69) HCPr118(−):5′-actagtcgactaYTGIATICCRCTIATRWARTTCCACAT-3′ (SEQ ID NO 71) set C:HCPr117(+): 5′-ttttAAATACATCGCIRCITGCATGCA-3′ (SEQ ID NO 72) HCPr66 (−):5′-ctattaTTGTATCCCRCTGATGAARTTCCACAT-3′ (SEQ ID NO 70) set D:HCPr117(+): 5′-ttttAAATACATCGCIRCITGCATGCA-3′ (SEQ ID NO 72) HCPr118(−):5′-actagtcgactaYTGIATICCRCTIATRWARTTCCACAT-3′ (SEQ ID NO 71) set E:HCPr116(+): 5′-ttttAAATACATCATGRCITGYATG-3′ (SEQ ID NO 69) HCPr119(−):actagtcgactaRTTIGCIATIAGCCG/TRTTCATCCAYTG-3′ (SEQ ID NO 73) set F:HCPr117(+): 5′-ttttAAATACATCGCIRCITGCATGCA-3′ (SEQ ID NO 72) HCPr119(−):actagtcgactaRTTIGCIATIAGCCG/TRTTCATCCAYTG-3′ (SEQ ID NO 73) set G:HCPr131(+): 5′-ggaattctagaCCITCITGGGAYGARAYITGGAARTG-3′ (SEQ ID NO 74)HCPr66 (−): 5′-ctattaTTGTATCCCRCTGATGAARTTCCACAT-3′ (SEQ ID NO 70) setH: HCPr130(+): 5′-ggaattctagACIGCITAYCARGCIACIGTITGYGC-3′ (SEQ ID NO 75)HCPr66 (−): 5′-ctattaTTGTATCCCRCTGATGAARTTCCACAT-3′ (SEQ ID NO 70) setI: HCPr134(+): 5′-CATATAGATGCCCACTTCCTATC-3′ (SEQ ID NO 76) HCPr66 (−):5′-ctattaTTGTATCCCRCTGATGAARTTCCACAT-3′ (SEQ ID NO 70) set J:HCPr131(+): 5′-ggaattctagaCCITCITGGGAYGARAYITGGAARTG-3′ (SEQ ID 74)HCPr118(−): 5′-actagtcgactaYTGIATICCRCTIATRWARTTCCACAT-3′ (SEQ ID NO 71)set K: HCPr130(+): 5′-ggaattctagACIGCITAYCARGCIACIGTITGYGC-3′ (SEQ ID NO75) HCPr118(−): 5′-actagtcgactaYTGIATICCRCTIATRWARTTCCACAT-3′ (SEQ ID NO71) set L: HCPr134(+): 5′-CATATAGATGCCCACTTCCTATC-3′ (SEQ ID NO 76)HCPr118(−): 5′-actagtcgactaYTGIATICCRCTIATRWARTTCCACAT-3′ (SEQ ID NO 71)set M: HCPr3(+): 5′-GTGTGCCAGGACCATC-3′ (SEQ ID NO 77) and HCPr4(−):5′-GACATGCATGTCATGATGTA-3′ (SEQ ID NO 78) set N: HCPr3(+):5′-GTGTGCCAGGACCATC-3′ (SEQ ID NO 77) and HCPr118(−):5′-actagtcgactaYTGIATICCRCTIATRWARTTCCACAT-3′ (SEQ ID NO 71) set O:HCPr3(+): 5′-GTGTGCCAGGACCATC-3′ (SEQ ID NO 77) and HCPr66 (−):5′-ctattaTTGTATCCCRCTGATGAARTTCCACAT-3′ (SEQ ID NO 70)

[0383] set O yielded what appeared to be a PCR artifact fragmentestimated about 1450 base pairs, instead of the expected 628 base pairs.Although it is not expected that PCR artifact fragments containinformation of the gene or genome that was targetted in the experiment,efforts were put in cloning of this artifact fragment, which wasdesignated fragment PC-1. The following clones, were obtained fromfragment PC-1:

[0384] PC-1-37 (SEQ ID NO 59 and SEQ ID NO 55), PC-1-48 (SEQ ID NO 61and SEQ ID NO 57)

[0385] The sequences obtained from the 5′ and 3′ ends of the clones aregiven in SEQ ID NOS 55, 57, 59, and 61, and the complete sequences withSEQ ID NO 197 and 199 are shown aligned with the sequences of prototypeisolates of other types of HCV in FIG. 10 and the alignment of thededuced amino acid sequences is shown in FIG. 11 and 7. Surprisingly,the PCR artifact clone contained HCV information. The positions of thesequences within the HCV genome are compatible with a contiguous HCVsequence of 1437 nucleotides, which was the estimated size of the clonedPCR artifact fragment. Primer HCPr66 primed correctly at the expectedposition in the HCV genome. Therefore, primer HCPr3 must haveincidentally misprimed at a position 809 nucleotides upstream of itslegitimate position in the HCV genome. This could not be expected sinceno sequence information was available from a coding region of type 5.

Example 7 The E2 region of HCV type 5

[0386] Serum BE95 was chosen for experiments aimed at amplifying a partof the E2 region of HCV type 5.

[0387] After aligning the sequences of HCV-1 (2), HCV-J (1), HC-J6 (3),and HC-J8 (4), PCR primers were chosen in those regions of littlesequence variation.

[0388] Primers HCPr109(+): 5′-TGGGATATGATGATGAACTGGTC-3′ (SEQ ID NO 141)and HCPr14(−): 5′-CCAGGTACAACCGAACCAATTGCC-3′ (SEQ ID NO 142) werecombined to amplify the aminoterminal region of the E2/NS1 region, andwere synthesized on a 392 DNA/RNA synthesizer (Applied Biosystems). Withprimers HCPr109 and HCPr14, a PCR fragment of 661 bp was generated,containing 169 nucleodtides corresponding to the E1 carboxyterminus and492 bases from the region encoding the E2 aminoterminus.

[0389] An alignment of the type 5 E1/E2 sequences with seq ID NO. 158with known sequences is presented in FIG. 10. The deduced proteinsequence was compared with the different genotypes (FIG. 12, amino acids328-546). In the E1 region, there were no extra structural importantmotifs found. The aminoterminal part of E2 was hypervariable whencompared with the other genotypes. All 6 N-glycosylation sites and all 7cysteine residue's were conserved in this E2 region. To preservealignment, it was necessary to introduce a gap between aa 474 and 475 asfor type 3 a, but not between aa 480 and 481, as for type 2.

Example 8 The NS5b region of HCV type 4

[0390] Type 4 sera GB48, GB116, GB215, and GB358, selected by means ofthe line probe assay (LiPA, Stuyver et al., 1993), as well as sera GB549and GB809 that could not be typed by means of this LiPA (onlyhybridization was observed with the universal probes), were selectedfrom Gabonese patients. All these sera were positive after the firstround of PCR reactions for the 5′ untranslated region (Stuyver et al.,1993) and were retained for further study.

[0391] RNA was isolated from the sera and cDNA synthesized as describedin example 1.

[0392] Universal primers in the NS5 region were selected after alignmentof the published sequences as follows: HCPr206(+):5′-TGGGGATCCCGTATGATACCCGCTGCTTTGA-3′ (SEQ ID NO. 124) and HCPr207(−):5′-GGCGGAATTCCTGGTCATAGCCTCCGTGAA-3′ (SEQ ID NO. 125);

[0393] and were synthesized on a 392 DNA/RNA synthesizer (AppliedBiosystems). Using the Line Probe Assay (LiPA), four high-titer type 4sera and 2 sera that could not be classified were selected andsubsequently analyzed with the primer set HCPr206/207. NS5 PCR fragmentsobtained using these primers from serum GB48 (GB48-3), serum GB116(GB116-3), serum GB215 (GB215-3), serum GB358 (GB358-3), serum GB549(GB549-3), and serum GB809 (GB809-3), were selected for cloning. Thefollowing sequences were obtained from the PCR fragments:

[0394] From fragment GB48-3: GB48-3-10 (SEQ ID NO. 106)

[0395] From fragment GB116-3: GB116-3-5 (SEQ ID NO. 108)

[0396] From fragment GB215-3: GB215-3-8 (SEQ ID NO. 110)

[0397] From fragment GB358-3: GB358-3-3 (SEQ ID NO. 112)

[0398] From fragment GB549-3: GB549-3-6 (SEQ ID NO. 114)

[0399] From fragment GB809-3: GB809-3-1 (SEQ ID NO. 116)

[0400] An alignment of nucleotide sequences with SEQ ID NO. 106, 108,110, 112, 114, and 116 with known sequences is given in FIG. 1. Analignment of deduced amino acid sequences with SEQ ID NO. 107, 109, 111,113, 115, and 117 with known sequences is given in FIG. 2. The 4isolates that had been typed as type 4 by means of LiPA are very closelyrelated to each other (mutual homologies of about 95%), but are onlydistantly related to type 1, type 2, and type 3 sequences (e.g. GB358shows homologies of 65.6 to 67.7% with other genotypes, Table 4). Thesequence obtained from sera GB549 and GB809 also show similar homologieswith genotypes 1, 2, and 3 (65.9 to 68.8% for GB549 and 65.0 to 68.5%for GB809, Table 4), but an intermediate homology of 79.7 to 86.8%(often observed between subtypes of the same type) exists between GB549or GB809 with the group of isolates consisting of GB48, GB116, GB215,and GB358, or between GB549 and GB809. These data indicate the discoveryof 3 new subtypes within the HCV genotype 4: in the present invention,these 3 subtypes are designated subtype 4 c, represented by isolatesGB48, GB116, GB215, and GB358, subtype 4 g, represented by isolateGB549, and subtype 4 e, represented by isolate GB809. Although thehomologies observed between subtypes in the NS5 region seem to indicatea closer relationship between subtypes 4 c and 4 e, the homologiesobserved in the E1 region indicate that subtypes 4g and 4 e show theclosest relation (see example 8).

Example 9 The Core/E1 region of HCV type 4

[0401] From each of the 3 new type 4 subtypes, one representative serumwas selected for cloning experiments in the Core/E1 region. GB549(subtype 4 g) and GB809 (subtype 4 e) were analyzed together withisolate GB358 that was chosen from the subtype 4 c group.

[0402] Synthetic oligonucleotides:

[0403] After aligning the sequences of HCV-1 (2), HCV-J (1), HC-J6 (3),and HC-J8 (4), PCR primers were chosen in those regions of littlesequence variation. Primers HCPr52(+): 5′-atgTTGGGTAAGGTCATCGATACCCT-3′,HCPr23(+): 5′-CTCATGGGGTACATTCCGCT-3′, and HCPr54(−):5′-CTATTACCAGTTCATCATCATATCCCA-3′, were synthesized on a 392 DNA/RNAsynthesizer (Applied Biosystems). The sets of primers HCPr23/54 andHCPr52/54 were used, but only with the primer set HCPr52/54, PCRfragments could be obtained. This set of primers amplified the sequencefrom nucleotide 379 to 957 encoding amino acids 127 to 319: 65 aminoacids from the carboxyterminus of core and 128 amino acids of E1. Theamplification products GB358-4, GB549-4, and GB809-4 were cloned asdescribed in example 1. The following clones were obtained from the PCRfragments:

[0404] From fragment GB358-4: GB358-4-1 (SEQ ID NO 118)

[0405] From fragment GB549-4: GB549-4-3 (SEQ ID NO 120)

[0406] From fragment GB809-4: GB809-4-3 (SEQ ID NO 122)

[0407] An alignment of the type 4 Core/E1 nucleotide sequences with seqID NO. 118, 120, and 122 with known sequences is presented in FIG. 4.The homologies of the type 4 E1 region (without core) with type 1, type2, type 3, and type 5 prototype sequences are depicted in Table 4.Homologies of 53 to 66% are observed with representative isolates ofnon-type 4 genotypes. Observed homologies in the E1 region within type4, between the different subtypes, ranges from 75.2 to 78.4%. Therecently disclosed sequences of the core region of Egyptian type 4isolates (for example EG-29 in FIG. 3) described by Simmonds et al.(1993) do not allow alignment with the Gabonese sequences (as describedin the present invention) in the NSB region and may belong to differenttype 4 subtypes(s) as can be deduced from the core sequences. Thededuced amino acid sequences with SEQ ID NO 119, 121, and 123 arealigned with other prototype sequences in FIG. 5. Again, type-specificvariation mainly resides in the variable V regions, designated in thepresent invention, and therefore, type-4-specific amino acids or Vregions will be instrumental in diagnosis and therapeutics for HCV type4.

Example 10 The Core/E1 and NS5b regions of new HCV type 2, 3 and 4subtypes

[0408] Samples NE92 (subtype 2 d), BE98 (subtype 3 c), CAM600 and GB809(subtype 4 e), CAMG22 and CAMG27 (subtype 4 f), GB438 (subtype 4 h),CAR4/1205 subtype (4 i), CAR1/501 (subtype 4 j), CAR1/901 (subtype 4?),and GB724 (subtype 4?) were selected from a group of sera that reactedpositive but aberrantly in a prototype Line Probe Assay as describedearlier (Stuyver et al., 1993). Another type 5 a isolate BE100 was alsoanalyzed in the C/E1 region, and yet another type 5 a isolate BE96 inthe NS5b region. A high-titer of HCV RNA could be detected, enablingcloning of fragments by a single round of PCR. As no sequences from anycoding region of these subtypes had been disclosed yet, syntheticoligonucleotides for PCR amplification were chosen in the regions oflittle sequence variation after aligning the sequences of HCV-1 (Choo etal., 1991), HCV-J (Kato et al., 1990), HC-J6 (Okamoto et al., 1991),HC-J8 (Okamoto et al., 1992), and the other new sequences of the presentinvention.

[0409] The above mentioned sets 1, 2 and 3 (see example 5) of primerswere used, but only with set 1, PCR fragments could be obtained from allisolates (except for BE98, GB724, and CAR1/501). This set of primersamplified the sequence from nucleotide 379 to 957 encoding amino acids127 to 319: 65 amino acids from the carboxyterminus of core and 128amino acids of E1. With set 3, the core/E1 region from isolate NE92 andBE98 could be amplified, and with set 2, the core region of GB358,GB724, GB809, and CAM600 could be amplified. The amplification productswere cloned as described in example 1. The following clones wereobtained from the PCR fragments:

[0410] From isolate GB724, the clone with SEQ ID NO 193 from the coreregion.

[0411] From isolate NE92, the clone with SEQ ID NO 143

[0412] From isolate BE98, the clone from the core/E1 region of whichpart of the sequence has been analyzed and is given in SEQ ID NO 147,

[0413] From isolate CAM600, the clone with SEQ ID NO 167 from the E1region, or SEQ ID NO 165 from the Core/E1 region as shown in FIG. 3,

[0414] From isolate CAMG22, the clone with SEQ ID NO 171 from the E1region as shown in FIG. 4,

[0415] from isolate GB358, the clone with SEQ ID NO 191 in the coreregion,.

[0416] from isolate CAMG27, the clone with SEQ ID NO 173 from thecore/E1 region,

[0417] from isolate GB438, the clone with SEQ ID NO 177 from the core/E1region,

[0418] from isolate CAR4/1205, the clone with SEQ ID NO 179 from thecore/E1 region,

[0419] from isolate CAR1/901, the clone with SEQ ID NO 181 from thecore/E1 region,

[0420] from isolate GB809, the clone GB809-4 with SEQ ID NO 189 from thecore/E1 region, clone GB809-2 with SEQ ID NO 169 from the core/E1 regionand the clone with SEQ ID NO 163 from the core region,

[0421] and from isolate BE100, the clone with SEQ ID NO 155 from theCore/E1 region as shown in FIG. 4.

[0422] An alignment of these Core/E1 sequences with known Core/E1sequences is presented in FIG. 4. The deduced amino acid sequences withSEQ ID NO 144, 148, 164, 168, 170, 172, 174, 178, 180, 182, 190, 192,194, 156, 166 are aligned with other prototype sequences in FIG. 5.Again, type-specific variation mainly resides in the variable V regions,designated in the present invention, and therefore, type 2 d, 3 c andtype 4-specific amino acids or V regions will be instrumental indiagnosis and therapeutics for HCV type (subtype) 2 d, 3 c or thedifferent type 4 subtypes.

[0423] The NS5b region of isolates NE92, BE98, CAM600, CAMG22, GB438,CAR4/1205, CAR1/501, and BE96 was amplified with primers HCPr206 andHCPr207 (Table 7). The corresponding clones were cloned and sequenced asin example 1 and the corresponding sequences (of which BE98 was partlysequenced) received the following identification numbers:

[0424] NE92: SEQ ID NO 145

[0425] BE98: SEQ ID NO 149

[0426] CAM600: SEQ ID NO 201

[0427] CAMG22: SEQ ID NO 203

[0428] GB438: SEQ ID NO 207

[0429] CAR4/1205: SEQ ID NO 209

[0430] CAR1/501: SEQ ID NO 211

[0431] BE95: SEQ ID NO 159

[0432] BE96: SEQ ID NO 161

[0433] An alignment of these NS5b sequences with known NS5b sequences ispresented in FIG. 1. The deduced amino acid sequences with SEQ ID NO146, 150, 202, 204, 206, 208, 210, 212, 160, 162 are aligned with otherprototype sequences in FIG. 2. Again, subtype-specific variations can beobserved, and therefore, type 2 d, 3 c and type 4-specific amino acidsor V regions will be instrumental in diagnosis and therapeutics for HCVtype (subtype) 2 d, 3 c or the different type 4 subtypes.

Example 11 Genotype-specific reactivity of anti-E1 antibodies(Serotyping)

[0434] E1 proteins were expressed from vaccinia virus constructscontaining a core/E1 region extending from nucleotide positions 355 to978 (Core/E1 clones described in previous examples including the primersHCPr52 and HCPr54), and expressed proteins from L119 (after theinitiator methionine) to W326 of the HCV polyprotein. The expressedprotein was modified upon expression in the appropriate host cells (e.g.HeLa, RK13, HuTK-, HepG2) by cleavage between amino acids 191 and 192 ofthe HCV polyprotein and by the addition of high-mannose typecarbohydrate motifs. Therefore, a 30 to 32 kDa glycoprotein could beobserved on western blot by means of detection with serum from patientswith hepatitis C.

[0435] As a reference, a genotype 1 b clone obtained form the isolateHCV-B was also expressed in an identical way as described above, and wasexpressed from recombinant vaccinia virus vvHCV-11A.

[0436] A panel of 104 genotyped sera was first tested for reactivitywith a cell lysate containing type 1 b protein expressed from therecombinant vaccinia virus vvHCV-11A, and compared with cell lysate ofRK13 cells infected with a wild type vaccinia virus (‘E1/WT’). Thelysates were coated as a 1/20 dilution on a normal ELISA microtiterplate (Nunc maxisorb) and left to react with a 1/20 diluation of therespective sera. The panel consisted of 14 type 1 a, 38 type 1 b, 21type 2, 21 type 3 a, and 9 type 4 sera. Human antibodies weresubsequently detected by a goat anti-human IgG conjugated withperoxidase and the enzyme activity was detected. The optical densityvalues of the E1 and wild type lysates were divided and a factor 2 wastaken as the cut-off. The results are given in the table A. Eleven outof 14 type 1 a sera (79%), 25 out of 38 type 1 b sera (66%), 6 out of 21(29%), 5 out of 21 (24%), and none of the 9 type 4 or the type 5 serumreacted (0%). These experiments clearly show the high prevalence ofanti-E1 antibodies reactive with the type 1 E1 protein in patientsinfected with type 1 (36/52 (69%)) (either type 1 a or type 1 b), butthe low prevalence or absence in non-type 1 sera (11/52 (21%)). TABLE Aserum E1/WT type 1a  3748 3.15  3807 3.51  5282 1.99  9321 3.12  93242.76  9325 6.12  9326 10.56  9356 1.79  9388 3.5  8366 10.72  8380 2.2710925 4.02 10936 5.04 10938 1.36 type 1b  5205 2.25  5222 1.33  52461.24  5250 13.58  5493 0.87  5573 1.75  8243 1.77  8244 2.05  8316 1.21 8358 5.04  9337 14.47  9410 5  9413 5.51 10905 1.26 10919 5.00 109288.72 10929 8.26 10931 2.3 10932 4.41   44 2.37   45 3.14   46 4.37   475.68   48 2.97   49 1.18   50 9.85   51 4.51   52 1.11   53 5.20   540.98   55 1.48   56 1.06   57 3.85   58 7.6   59 3.28   60 3.23   617.82   62 1.92 type 2   23 0.91   24 1.16   25 2.51   26 0.96   27 1.20  28 0.96   29 2.58   30 8.05   31 0.92   32 0.82   33 5.75   34 0.79  35 0.86   36 0.85   37 0.76   38 0.92   39 1.08   40 2.33   41 2.83  42 1.21   43 0.91 type 3   1 6.88   2 1.47   3 3.06   4 6.52   5 10.24  6 2.72   7 1.11   8 1.54   9 1.60   10 1.21   11 1.07   12 1.00   130.85   14 0.96   15 0.51   16 1.00   17 1.09   18 0.99   19 1.04   201.04   21 0.96 type 4   22 0.87 GB48 0.49 GB113 0.68 GB116 0.73 GB2150.52 GB358 0.56 GB359 0.71 GB438 1.08 GB516 1.04 type 5 BE95 0.86

[0437] Core/E1 clones of isolates BR36 (type 3 a) and BE95 (type 5 a)were subsequently recombined into the viruses wHCV-62 and vvHCV-63,respectively. A genotyped panel of sera was subsequently tested ontocell lysates obtained from RK13 cells infected with the recombinantviruses vvHCV-62 and wHCV-63. Tests were carried out as described aboveand the results are given in the table given below (TABLE B). From theseresults, it can clearly be seen that, although some cross-reactivityoccurs (especially between type 1 and 3), the obtained values of a givenserum are usually higher on its homologous E1 protein than on an E1protein of another genotype. For type 5 sera, none of the 5 sera werereactive on type 1 or 3 E1 proteins, while 3 out of 5 were shown tocontain anti-E1 antibodies when tested on their homologous type 5protein. Therefore, in this simple test system, a considerable number ofsera can already be serotyped. Combined with the reactivity totype-specific NS4 epitopes or epitopes derived from other type-specificparts of the HCV polyprotein, a serotyping assay may be developed fordiscriminating the major types of HCV. To overcome the problem ofcross-reactivity, the position of cross-reactive epitopes may bedetermined by someone skilled in the art (e.g. by means of competitionof the reactivity with synthetic peptides), and the epitopes evokingcross-reactivity may be left out of the composition to be included inthe serotyping assay or may be included in sample diluent to outcompetecross-reactive antibodies. TABLE B serum E1^(1b)/WT E1^(3a)/WTE1^(5a)/WT type 1b 8316 0.89 0.59 0.80 8358 2.22 2.65 1.96 9337 1.590.96 0.93 9410 16.32 9.60 3.62 9413 9.89 2.91 2.85 10905 1.04 0.96 1.0510919 3.17 2.56 2.96 10928 4.39 2.28 2.07 10929 2.95 2.07 2.08 109313.11 1.49 2.11 5 0.86 0.86 0.96 6 3.48 1.32 1.32 7 6.76 4.00 3.77 810.88 3.44 4.04 9 1.76 1.88 1.58 10 9.88 7.48 7.20 11 8.48 8.99 8.45 120.76 0.72 0.76 13 5.04 5.67 5.37 14 10.48 10.54 11.22 15 5.18 1.62 1.65type 3 8332 3.39 4.22 0.66 10907 3.24 4.39 0.96 10908 0.99 0.94 0.9810934 0.86 0.90 0.90 10927 2.58 2.71 2.44 8210 0.82 0.80 0.86 8344 1.096.66 1.17 8351 1.21 1.29 1.22 30 0.85 4.11 0.98 32 0.85 2.16 1.04 type 50.78 0.95 1.54 BE110 0.79 1.01 4.95 BE95 0.47 0.52 0.65 BE111 0.71 0.758.33 BE112 1.01 1.27 2.37 BE113 1.11 1.35 1.60

[0438] TABLE 5 Homologies of new HCV sequences with other known HCVtypes Region Isolate 1a 1b 2a 2b 3a 3b (nucleotides) (type) HCV-1 HCV-JHC-J6 HC-J8 T1 T7 T9 T10 Core (1-573) PC (5) 83.8 (91.6) 84.8 (92.1)82.6 (90.1) 82.4 (89.0) E1 (574-957) HD10 (3) 61.5 (68.0) 64.6 (68.8)57.8 (55.5) 56.3 (59.4) BR36 (3) 62.0 (66.4) 62.5 (67.2) 56.5 (53.9)55.2 (58.6) BR33 (3) 60.7 (67.2) 63.3 (68.0) 56.5 (54.7) 56.0 (58.6) PC(5) 61.4 (64.0) 62.4 (64.8) 54.1 (49.6) 53.3 (47.2) GB358 (4a) 62.5(69.1) 62.8 (65.9) 59.4 (54.0) 54.4 (54.0) GB549 (4b) 66.0 (72.2) 62.8(69.8) 59.1 (56.4) 56.5 (54.0) GB809 (4c) 63.3 (69.1) 60.7 (64.3) 56.7(53.2) 53.0 (51.6) NS3 PC (5) 74.7 (89)   76.1 (86.4) 76.1 (89.8) 78.0(89.0) (3856-4209) NS4 BR36 (3) 67.8 (78.5) 69.8 (75.1) 62.0 (67.5) 61.7(66.0) (4892-5292) HD 10 (3) 69.8 (74.6) 66.6 (69.7) 57.8 (59.9) 59.1(59.9) NS4 PC (5) 61.3 (62.2) 63.0 (65.5) 52.9 (46.2) 54.3 (43.7)(4936-5292) NS5b BR34 (3) 65.7     66.7     63.9     64.3     94.8 93.975.6 77.0 (8023-8235) BR36 (3) 64.3     67.6     64.8     66.7     94.893.4 75.1 76.5 BR33 (3) 65.7     67.1     64.3     64.8     94.8 93.976.0 77.5 GB358 (4a) 67.7 (76.1) 65.6 (77.0) 66.5 (70.8) 65.6 (71.7)GB549 (4b) 68.8 (76.1) 67.1 (77.0) 65.9 (71.7) 65.9 (74.4) GB809 (4c)68.5 (73.5) 65.0 (73.5) 67.7 (69.9) 67.7 (73.5)

[0439] Shown are the nucleotide homologies (the amino-acid homology isgiven between brackets) for the region indicated in the left column.TABLE 6 NS4 sequences of the different genotypes SYNTHETIC PEPTIDE NS4-1SYNTHETIC PEPTIDE NS4-5 SYNTHETIC PEPTIDE NS4-7 prototype TYPE (NS4a)(NS4b) (NS4b) position-> 1 1 1 1 1 1 6 7 7 7 7 7 9 0 2 3 3 4 0 0 0 0 0 0        * *  *    **   **       *       *   *  *  *    *   *  ** HCV-11a LSG KPAIIPDREV   LYREFDE SQHLPYIEQ GMMLAEQFKQ KLAEQFKQ  KALGLLQTAS RQA HCV-J 1b LSG RPAVIPDREV  LYQEFDEASHLPYIEQ GMQLAEQFKQ K LAEQFKQ  KALGLLQTAT KQA HC-J6 2aVNQ RAVVAPDKEV LYEAFDE ASRAALIEE  GQRIAEMLKS K IAEMLKS  KIQGLLQQAS KQAHC-J8 2b LND RVVVAPDKEI  LYEAFDE ASKAALIEE GQRMAEMLKS KMAEMLKS KIQGLLQQAT  RQA BR36 3a LGG KPAIVPDKEV LYQQ YDESQAAPYIEQ  AQVIAHQFKE K IAHQFKE KVLGLLQRAT QQQ PC 5LSG KFAIIPDREA  LYQQFDE AASLPYMDE  TRAIAGQFKE K IAGQFKE KVLGFISTTG  Q KA             V

[0440] TABLE 7 SEQ ID Primer NO NO (polarity) Sequence from 5′ to 3′ 63HCPr161(+) 5′-ACCGGAGGCCAGGAGAGTGATCTCCTCC-3′ 64 HCPr162(−)5′-GGGCTGCTCTATCCTCATCGACGCCATC-3′ 65 HCPr163(+)5′-GCCAGAGGCTCGGAAGGCGATCAGCGCT-3′ 66 HCPr164(−)5′-GAGCTGCTCTGTCCTCCTCGACGCCGCA-3′ 67 HCPr23(+)5′-CTCATGGGGTACATTCCGCT-3′ 68 HCPr54(−)5′-CTATTACCAGTTCATCATCATATCCCA-3′ 69 HCPr116(+)5′-ttttAAATACATCATGRCITGYATG-3′ 70 HCPr66(−)5′-ctattaTTGTATCCCRCTGATGAARTTCCACAT-3′ 71 HCPr118(−)5′actagtcgactaYTGIATICCRCTIATRWARTTCCACAT-3′ 72 HCPr117(+)5′-ttttAAATACATCGCIRCITGCATGCA-3′ 73 HCPr119(−)5′-actagtcgactaRTTIGCIATIAGCCKRTTCATCCAYTG-3′ 74 HCPr131(+)5′-ggaattctagaCCITCITGGGAYGARAYITGGAARTG-3′ 75 HCPr130(+)5′-ggaattctagACIGCITAYCARGCIACIGTITGYGC-3′ 76 HCPr134(+)5′-CATATAGATGCCCACTTCCTATC-3′ 77 HCPr3(+) 5′-GTGTGCCAGGACCATC-3′ 78HCPr4(−) 5′-GACATGCATGTCATGATGTA-3′ 79 HCPr152(+)5′-TACGCCTCTTCTATATCGGTTGGGGCCTG-3′ 80 HCPr52(+)5′-atgTTGGGTAAGGTCATCGATACCCT-3′ 81 HCPr41(+)5′-CCCGGGAGGTCTCGTAGACCGTGCA-3′ 82 HCPr40(−)5′-ctattaAAGATAGAGAAAGAGCAACCGGG-3′ 124  HCPR2065′-tggggatcccgtatgatacccgctgctttga-3′ 125  HCPR2075′-ggcggaattcctggtcatagcctccgtgaa-3′ 141  HCPR1095′-tgggatatgatgatgaactggtc-3′ 142  HCPR14 5′-ccaggtacaaccgaaccaattgcc-3′

[0441] TABLE 8 NS4 SEROTYPING Type 1 NS4 Type 2 NS4 Type 3 NS4 serum 1 57 1 5 7 1 5 7 type 1a 101 3 3 3 − 1 3 +/− +/− 3 102 1 +/− 2 − − 2 − − 1103 1 3 3 − +/− 3 − +/− 3 104 3 3 3 2 2 3 3 +/− 2 105 3 3 3 − 2 2 +/−+/− 2 106 3 1 1 − 1 2 +/− +/− +/− 107 3 3 3 − 2 2 2 − 1 108 3 3 3 − +/−2 +/− 1 2 109 3 3 3 +/− 2 3 1 − 3 110 3 3 3 − +/− 1 − − 3 type 1b 111+/− +/− − − − − − − − 112 − 2 3 − − 2 − − 3 113 2 3 3 − − 1 − − 3 114 23 3 1 + 2 + 1 3 115 3 3 3 − + 3 − − 3 116 3 3 3 − +/− 1 − − 1 117 3 − −3 +/− +/− +/− − − 118 1 2 3 − +/− 2 − +/− 3 119 +/− 2 2 +/− +/− 2 + 1 2120 − 3 3 −3 +/− +/− − − − 121 3 3 3 +/− 2 2 2 2 3 122 3 3 1 − 1 2 2 1 1123 3 3 2 − 1 2 − 1 1 124 3 3 3 +/− 2 − − 2 125 3 3 3 1 1 3 2 1 3 126 12 2 1 1 1 1 1 1 127 3 2 +/− − +/− 1 +/− +/− +/− 128 3 3 3 − +/− 1 2 +/−+/− 129 2 3 3 − − 3 − − 3 130 − 2 1 +/− − − − − − 131 − 1 1 − − − − −+/− 132 − − − +/− − +/− +/− − − 133 3 3 3 − 1 3 − 1 3 134 − 2 2 − − − −− − 135 3 3 3 1 + 2 2 1 3 136 − 3 3 +/− +/− +/− +/− − 3 137 +/− +/− +/−+/− +/− +/− +/− − − 138 3 3 3 +/− 2 2 1 + 3 type 2a 139 3 − − 3 3 +/− 1− − 140 +/− − − 3 3 3 3 − − 141 2 − − 2 1 +/− 2 − − 142 − − − − +/− − −− − 143 − +/− +/− 1 2 1 1 +/− +/− 144 1 1 + 1 3 2 1 1 2 145 − +/− +/− 31 2 2 +/− +/− 146 − − − +/− +/− − − − − 147 − +/− − 3 1 3 − − − 148 − −− +/− − − +/− − − type 2b 149 − +/− +/− 3 3 1 2 +/− +/− type 3 150 +/−+/− +/− +/− +/− +/− 1 3 3 151 − − − − − − 2 − 2 152 +/− − − − − − 3 − −153 − − − − − − − 1 − 154 +/− 1 3 − +/− 2 2 1 3 155 − 2 3 − 2 2 1 1 3156 − − − − − − − − − 157 − − − +/− +/− − +/− 2 2 158 2 − − − 1 2 3 2 2159 − − − − +/− +/− − 3 3 160 − − − − +/− − − 2 3 161 − − − − 1 1 +/− 32 type 4 162 1 − − − − − − − − 163 2 − − − +/− +/− +/− − −

[0442] REFERENCES

[0443] Barany F (1991). Genetic disease detection and DNA amplificationusing cloned thermostable ligase. Proc Natl Acad Sci USA 88: 189-193.

[0444] Bej A, Mahbubani M, Miller R, Di Cesare J, Haff L, Atlas R (1990)Mutiplex PCR amplification and immobilized capture probes for detectionof bacterial pathogens and indicators in water. Mol Cell Probes4:353-365.

[0445] Bukh J, Purcell R, Miller R (1992). Sequence analysis of the 5′noncoding region of hepatitis C virus. Proc Natl Acad Sci USA89:4942-4946.

[0446] Bukh J, Purcell R, Miller R (1993). At least 12 genotypes . . .PNAS 90,8234-8238.

[0447] Cha T, Beal E, Irvine B, Kolberg J, Chien D, Kuo G, Urdea M(1992) At least five related, but distinct, hepatitis C viral genotypesexist. Proc Natl Acad Sci USA 89:7144-7148.

[0448] Chan S-W, Simmonds P, McOmish F, Yap P, Mitchell R, Dow B,Follett E (1991) Serological responses to infection with three differenttypes of hepatitis C virus. Lancet 338:1991.

[0449] Chan S-W, McOmish F, Holmes E, Dow B, Peutherer J, Follett E, YapP, Simmonds P (1992) Analysis of a new hepatitis C virus type and itsphylogenetic relationship to existing variants. J Gen Virol73:1131-1141.

[0450] Chomczynski P, Sacchi N (1987) Single step method of RNAisolation by acid guanidinium thiocyanate-phenol-chloroform extraction.Anal Biochem 162:156-159.

[0451] Choo Q, Richman K, Han J, Berger K, Lee C, Dong C, Gallegos C,Coit D, Medina-Selby A, Barr P, Weiner A, Bradley D, Kuo G, Houghton M(1991) Genetic organization and diversity of the hepatitis C virus. ProcNatl Acad Sci USA 88:2451-2455.

[0452] Compton J (1991). Nucleic acid sequence-based amplification.Nature, 350: 91-92.

[0453] Duchosal A, Eming S, Fisher P (1992) Immunization of hu-PBL-SCIDmice and the resue of human monoclonal Fab fragments throughcombinatorial libraries. Nature 355:258-262.

[0454] Duck P (1990). Probe amplifier system based on chimeric cyclingoligonucleotides. Biotechniques 9, 142-147.

[0455] Guatelli J, Whitfield K, Kwoh D, Barringer K, Richman D, GengerasT (1990) Isothermal, in vitro amplification of nucleic acids by amultienzyme reaction modeled after retroviral replication. Proc NatlAcad Sci USA 87: 1874-1878.

[0456] Hijikata M, Kato N, Ootsuyama Y, Nakagawa M, Shimotohmo K (1991)Gene mapping of the putative structural region of the hepatitis C virusgenome by in vitro processing analysis. Proc Natl Acad Sci USA 88,5547-5551.

[0457] Jacobs K, Rudersdorf R, Neill S, Dougherty J, Brown E, Fritsch E(1988) The thermal stability of oligonucleotide duplexes is sequenceindependent in tetraalkylammonium salt solutions: application toidentifying recombinant DNA clones. Nucl Acids Res 16:4637-4650.

[0458] Kato N, Hijikata M, Ootsuyama Y, Nakagawa M, Ohkoshi S, SugimuraT, Shirnotohno K (1990) Molecular cloning of the human hepatitis C virusgenome from Japanese patients with non-A, non-B hepatitis. Proc NatlAcad Sci USA 87:9524-9528.

[0459] Kwoh D, Davis G, Whitfield K, Chappelle H, Dimichele L, GingerasT (1989). Transcription-based amplification system and detection ofamplified human immunodeficiency virus type 1 with a bead-based sandwichhybridization format. Proc Natl Acad Sci USA, 86: 1173-1177.

[0460] Kwok S, Kellogg D, McKinney N, Spasic D, Goda L, Levenson C,Sinisky J, (1990). Effects of primer-template mismatches on thepolymerase chain reaction: Human immunodeficiency views type 1 modelstudies. Nucl. Acids Res., 18: 999.

[0461] Landgren U, Kaiser R, Sanders J, Hood L (1988). A ligase-mediatedgene detection technique. Science 241:1077-1080.

[0462] Lizardi P, Guerra C, Lomeli H, Tussie-Luna I, Kramer F (1988)Exponential amplification of recombinant RNA hybridization probes.Bio/Technology 6:1197-1202.

[0463] Lomeli H, Tyagi S, Printchard C, Lisardi P, Kramer F (1989)Quantitative assays based on the use of replicatable hybridizationprobes. Clin Chem 35: 1826-1831.

[0464] Machida A, Ohnuma H, Tsuda F, Munekata E, Tanaka T, Akahane Y,Okamoto H, Mishiro S (1992) Hepatology 16, 886-891.

[0465] Maniatis T, Fritsch E, Sambrook J (1982) Molecular cloning: alaboratory manual. Cold Spring Harbor Laboratory Press, Cold SpringHarbor, N.Y.

[0466] Mori S, Kato N, Yagyu A, Tanaka T, Ikeda Y, Petchclai B,Chiewsilp P, Kurimura T, Shimotohno K (1992) A new type of hepatitis Cvirus in patients in Thailand. Biochem Biophys Res Comm 183:334-342.

[0467] Okamoto H, Okada S, Sugiyama Y, Kurai K, Iizuka H. Machida A,Miyakawa Y, Mayumi M (1991) Nucleotide sequence of the genomic RNA ofhepatitis C virus isolated from a human carrier: comparison withreported isolates for conserved and divergent regions. J Gen Virol72:2697-2704.

[0468] Okamoto H, Kurai K, Okada S, Yamamoto K, Lizuka H, Tanaka T,Fukuda S, Tsuda F, Mishiro S (1992) Full-length sequences of a hepatitisC virus genome having poor homology to reported isolates: comparativestudy of four distinct genotypes. Virology 188:331-341.

[0469] Persson M, Caothien R, Burton D (1991) Generation of diversehigh-affinity human monoclonal antibodies by repertoire cloning. ProcNatl Acad Sci USA 89:2432-2436. Saiki R, Gelfand D, Stoffel S, Scharf S,Higuchi R, Horn G, Mullis K, Erlich H (1988). Primer-directed enzymaticamplification of DNA with a thermostable DNA polymerase. Science239:487-491.

[0470] Saiki R, Walsh P, Levenson C, Erlich H (1989) Genetic analysis ofamplified DNA with immobilized sequence-specific oligonucleotide probes(1989) Proc Natl Acad Sci USA

[0471] Sano T, Smith C, Cantor C (1992) Immuno-PCR: very sensitiveantigen detection by means of specific antibody-DNA conjugates. Science258:120-122.

[0472] Simmonds P, McOmsh F, Yap P, Chan S, Lin C, Dusheiko G, Saeed A,Holmes E (1993), Sequence variability in the 5′ non-coding region ofhepatitis C virus: identification of a new virus type and restrictionson sequence diversity. J Gen Virology, 74:661-668.

[0473] Stuyver L, Rossau R, Wyseur A, Duhamel M, Vanderborght B, VanHeuverswyn H, Maertens G (1993) Typing of hepatitis C virus (HCV)isolates and characterization of new (sub)types using a Line ProbeAssay. J Gen Virology, 74: 1093-1102.

[0474] Walker G, Little M, Nadeau J, Shank D (1992). Isothermal in vitroamplification of DNA by a restriction enzyme/DNA polymerase system. ProcNatl Acad Sci USA 89:392-396.

[0475] Wu D, Wallace B (1989). The ligation amplification reaction(LAR)-amplification of specific DNA sequences using sequential rounds oftemplate-dependent ligation. Genomics 4:560-569

0 SEQUENCE LISTING (1) GENERAL INFORMATION: (iii) NUMBER OF SEQUENCES:270 (2) INFORMATION FOR SEQ ID NO: 1: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 213 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single(D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO(iii) ANTI-SENSE: NO (vii) IMMEDIATE SOURCE: (B) CLONE: BR34-4-20 (ix)FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 1..213 (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 1: CTC ACG GAA CGG CTT TAC TGC GGG GGC CCT ATGTTC AAC AGC AAG GGG 48 Leu Thr Glu Arg Leu Tyr Cys Gly Gly Pro Met PheAsn Ser Lys Gly 1 5 10 15 GCC CAG TGT GGT TAT CGC CGC TGC CGT GCC AGTGGA GTT CTG CCT ACC 96 Ala Gln Cys Gly Tyr Arg Arg Cys Arg Ala Ser GlyVal Leu Pro Thr 20 25 30 AGC TTC GGC AAC ACA ATC ACT TGC TAC ATC AAG GCCACA GCG GCT GCA 144 Ser Phe Gly Asn Thr Ile Thr Cys Tyr Ile Lys Ala ThrAla Ala Ala 35 40 45 AGG GCC GCA GGC CTC CGG AAC CCG GAC TTT CTT GTC TGCGGA GAT GAT 192 Arg Ala Ala Gly Leu Arg Asn Pro Asp Phe Leu Val Cys GlyAsp Asp 50 55 60 CTG GTC GTG GTG GCT GAG AGT 213 Leu Val Val Val Ala GluSer 65 70 (2) INFORMATION FOR SEQ ID NO: 2: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 71 amino acids (B) TYPE: amino acid (D)TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 2: Leu Thr Glu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn SerLys Gly 1 5 10 15 Ala Gln Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly ValLeu Pro Thr 20 25 30 Ser Phe Gly Asn Thr Ile Thr Cys Tyr Ile Lys Ala ThrAla Ala Ala 35 40 45 Arg Ala Ala Gly Leu Arg Asn Pro Asp Phe Leu Val CysGly Asp Asp 50 55 60 Leu Val Val Val Ala Glu Ser 65 70 (2) INFORMATIONFOR SEQ ID NO: 3: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 213 basepairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY:linear (ii) MOLECULE TYPE: cDNA (vii) IMMEDIATE SOURCE: (B) CLONE:BR36-23-18 (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 1..213 (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 3: CTC ACG GAA CGG CTT TAC TGC GGG GGCCCT ATG TTC AAC AGC AAG GGG 48 Leu Thr Glu Arg Leu Tyr Cys Gly Gly ProMet Phe Asn Ser Lys Gly 1 5 10 15 GCC CAG TGT GGT TAT CGC CGC TGC CGTGCC AGT GGA GTT CTG CCT ACC 96 Ala Gln Cys Gly Tyr Arg Arg Cys Arg AlaSer Gly Val Leu Pro Thr 20 25 30 AGC TTC GGC AAC ACA ATC ACT TGC TAC ATCAAG GCC ACA GCG GCT GCA 144 Ser Phe Gly Asn Thr Ile Thr Cys Tyr Ile LysAla Thr Ala Ala Ala 35 40 45 AGG GCC GCA GGC CTC CGG AAC CCG GAC TTT CTTGTC TGC GGA GAT GAT 192 Arg Ala Ala Gly Leu Arg Asn Pro Asp Phe Leu ValCys Gly Asp Asp 50 55 60 CTG GTC GTG GTG GCT GAG AGT 213 Leu Val Val ValAla Glu Ser 65 70 (2) INFORMATION FOR SEQ ID NO: 4: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 71 amino acids (B) TYPE: amino acid (D)TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 4: Leu Thr Glu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn SerLys Gly 1 5 10 15 Ala Gln Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly ValLeu Pro Thr 20 25 30 Ser Phe Gly Asn Thr Ile Thr Cys Tyr Ile Lys Ala ThrAla Ala Ala 35 40 45 Arg Ala Ala Gly Leu Arg Asn Pro Asp Phe Leu Val CysGly Asp Asp 50 55 60 Leu Val Val Val Ala Glu Ser 65 70 (2) INFORMATIONFOR SEQ ID NO: 5: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 213 basepairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY:linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE:NO (vii) IMMEDIATE SOURCE: (B) CLONE: BR36-23-18 (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 1..213 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: CTC ACG GAG CGG CTT TAC TGC GGG GGC CCT ATG TTT AAC AGC AAG GGG 48Leu Thr Glu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn Ser Lys Gly 1 5 1015 GCC CAG TGT GGT TAT CGC CGT TGC CGT GCC AGT GGA GTT CTG CCT ACC 96Ala Gln Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr 20 25 30AGC TTC GGC AAC ACA ATC ACT TGT TAC ATC AAA GCC ACA GCG GCC GCA 144 SerPhe Gly Asn Thr Ile Thr Cys Tyr Ile Lys Ala Thr Ala Ala Ala 35 40 45 AAAGCC GCA GGC CTC CGG AGC CCG GAC TTT CTT GTC TGC GGA GAT GAT 192 Lys AlaAla Gly Leu Arg Ser Pro Asp Phe Leu Val Cys Gly Asp Asp 50 55 60 CTG GTCGTG GTG GCT GAG AGT 213 Leu Val Val Val Ala Glu Ser 65 70 (2)INFORMATION FOR SEQ ID NO: 6: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH:71 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULETYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: Leu Thr Glu ArgLeu Tyr Cys Gly Gly Pro Met Phe Asn Ser Lys Gly 1 5 10 15 Ala Gln CysGly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr 20 25 30 Ser Phe GlyAsn Thr Ile Thr Cys Tyr Ile Lys Ala Thr Ala Ala Ala 35 40 45 Lys Ala AlaGly Leu Arg Ser Pro Asp Phe Leu Val Cys Gly Asp Asp 50 55 60 Leu Val ValVal Ala Glu Ser 65 70 (2) INFORMATION FOR SEQ ID NO: 7: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 213 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (vii) IMMEDIATE SOURCE: (B) CLONE:BR36-23-20 (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 1..213 (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 7: CTC ACG GAG CGG CTT TAC TGC GGG GGCCCT ATG TTT AAC AGC AAA GGG 48 Leu Thr Glu Arg Leu Tyr Cys Gly Gly ProMet Phe Asn Ser Lys Gly 1 5 10 15 GCC CAG TGT GGT TAT CGC CGT TGC CGTGCC AGT GGA GTT CTG CCT ACC 96 Ala Gln Cys Gly Tyr Arg Arg Cys Arg AlaSer Gly Val Leu Pro Thr 20 25 30 AGC TTC GGC AAC ACA ATC ACT TGT TAC ATCAAA GCC ACA GCG GCC GCA 144 Ser Phe Gly Asn Thr Ile Thr Cys Tyr Ile LysAla Thr Ala Ala Ala 35 40 45 AAA GCC GCA GGC CTC CGG AGC CCG GAC TTT CTTGTC TGC GGA GAT GAT 192 Lys Ala Ala Gly Leu Arg Ser Pro Asp Phe Leu ValCys Gly Asp Asp 50 55 60 CTG GTC GTG GTG GCT GAG AGT 213 Leu Val Val ValAla Glu Ser 65 70 (2) INFORMATION FOR SEQ ID NO: 8: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 71 amino acids (B) TYPE: amino acid (D)TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 8: Leu Thr Glu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn SerLys Gly 1 5 10 15 Ala Gln Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly ValLeu Pro Thr 20 25 30 Ser Phe Gly Asn Thr Ile Thr Cys Tyr Ile Lys Ala ThrAla Ala Ala 35 40 45 Lys Ala Ala Gly Leu Arg Ser Pro Asp Phe Leu Val CysGly Asp Asp 50 55 60 Leu Val Val Val Ala Glu Ser 65 70 (2) INFORMATIONFOR SEQ ID NO: 9: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 213 basepairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY:linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE:NO (vii) IMMEDIATE SOURCE: (B) CLONE: BR33-2-17 (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 1..213 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: CTC ACG GAG CGG CTT TAC TGC GGG GGC CCT ATG TTC AAC AGC AAG GGG 48Leu Thr Glu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn Ser Lys Gly 1 5 1015 GCC CAG TGT GGT TAT CGC CGT TGT CGT GCC AGT GGA GTT CTG CCT ACC 96Ala Gln Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr 20 25 30AGT TTC GGC AAC ACA ATC ACT TGT TAC ATC AAG GCC ACA GCG GCT GCA 144 SerPhe Gly Asn Thr Ile Thr Cys Tyr Ile Lys Ala Thr Ala Ala Ala 35 40 45 AAAGCC GCA GGC CTC CGG AAC CCG GAC TTT CTT GTT TGC GGA GAT GAT 192 Lys AlaAla Gly Leu Arg Asn Pro Asp Phe Leu Val Cys Gly Asp Asp 50 55 60 TTG GTCGTG GTG GCT GAG AGT 213 Leu Val Val Val Ala Glu Ser 65 70 (2)INFORMATION FOR SEQ ID NO: 10: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH:71 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULETYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: Leu Thr Glu ArgLeu Tyr Cys Gly Gly Pro Met Phe Asn Ser Lys Gly 1 5 10 15 Ala Gln CysGly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr 20 25 30 Ser Phe GlyAsn Thr Ile Thr Cys Tyr Ile Lys Ala Thr Ala Ala Ala 35 40 45 Lys Ala AlaGly Leu Arg Asn Pro Asp Phe Leu Val Cys Gly Asp Asp 50 55 60 Leu Val ValVal Ala Glu Ser 65 70 (2) INFORMATION FOR SEQ ID NO: 11: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 213 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (vii) IMMEDIATE SOURCE: (B) CLONE:BR33-2-21 (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 1..213 (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 11: CTC ACG GAG CGG CTT TAC TGC GGG GGCCCT ATG TTC AAC AGC AAG GGG 48 Leu Thr Glu Arg Leu Tyr Cys Gly Gly ProMet Phe Asn Ser Lys Gly 1 5 10 15 GCC CAG TGT GGT TAT CGC CGT TGT CGTGCC AGT GGA GTT CTG CCT ACC 96 Ala Gln Cys Gly Tyr Arg Arg Cys Arg AlaSer Gly Val Leu Pro Thr 20 25 30 AGT TTC GGC AAC ACA ATC ACT TGT TAC ATCAAG GCC ACA GCG GCT GCA 144 Ser Phe Gly Asn Thr Ile Thr Cys Tyr Ile LysAla Thr Ala Ala Ala 35 40 45 AAA GCC GCA GGC CTC CGG AAC CCG GAC TTT CTTGTT TGC GGA GAT GAT 192 Lys Ala Ala Gly Leu Arg Asn Pro Asp Phe Leu ValCys Gly Asp Asp 50 55 60 TTG GTC GTG GTG GCT GAG AGT 213 Leu Val Val ValAla Glu Ser 65 70 (2) INFORMATION FOR SEQ ID NO: 12: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 71 amino acids (B) TYPE: amino acid (D)TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 12: Leu Thr Glu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn SerLys Gly 1 5 10 15 Ala Gln Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly ValLeu Pro Thr 20 25 30 Ser Phe Gly Asn Thr Ile Thr Cys Tyr Ile Lys Ala ThrAla Ala Ala 35 40 45 Lys Ala Ala Gly Leu Arg Asn Pro Asp Phe Leu Val CysGly Asp Asp 50 55 60 Leu Val Val Val Ala Glu Ser 65 70 (2) INFORMATIONFOR SEQ ID NO: 13: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 541 basepairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY:linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE:NO (vii) IMMEDIATE SOURCE: (B) CLONE: HD10-2-5 (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 2..541 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: C GTC GGC GCT CCT GTA GGA GGC GTC GCA AGA GCC CTT GCG CAT GGC 46 ValGly Ala Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly 1 5 10 15 GTGAGG GCC CTT GAA GAC GGG ATA AAT TTC GCA ACA GGG AAT TTG CCC 94 Val ArgAla Leu Glu Asp Gly Ile Asn Phe Ala Thr Gly Asn Leu Pro 20 25 30 GGT TGCTCC TTT TCT ATC TTC CTT CTT GCT CTG TTC TCT TGC TTA ATC 142 Gly Cys SerPhe Ser Ile Phe Leu Leu Ala Leu Phe Ser Cys Leu Ile 35 40 45 CAT CCA GCAGCT AGT CTA GAG TGG CGG AAC ACG TCT GGC CTC TAT GTC 190 His Pro Ala AlaSer Leu Glu Trp Arg Asn Thr Ser Gly Leu Tyr Val 50 55 60 CTT ACC AAC GACTGT TCC AAT AGC AGT ATT GTG TAT GAG GCC GAT GAC 238 Leu Thr Asn Asp CysSer Asn Ser Ser Ile Val Tyr Glu Ala Asp Asp 65 70 75 GTT ATT CTG CAC ACACCC GGC TGT GTA CCT TGT GTT CAG GAC GGT AAT 286 Val Ile Leu His Thr ProGly Cys Val Pro Cys Val Gln Asp Gly Asn 80 85 90 95 ACA TCT GCG TGC TGGACC CCA GTG ACA CCT ACA GTG GCA GTC AGG TAC 334 Thr Ser Ala Cys Trp ThrPro Val Thr Pro Thr Val Ala Val Arg Tyr 100 105 110 GTC GGA GCA ACC ACCGCT TCG ATA CGC AGG CAT GTA GAC ATG TTG GTG 382 Val Gly Ala Thr Thr AlaSer Ile Arg Arg His Val Asp Met Leu Val 115 120 125 GGC GCG GCC ACG ATGTGC TCT GCT CTC TAC GTG GGT GAT ATG TGT GGG 430 Gly Ala Ala Thr Met CysSer Ala Leu Tyr Val Gly Asp Met Cys Gly 130 135 140 GCC GTC TTC CTC GTGGGA CAA GCC TTC ACG TTC AGA CCT CGT CGC CAT 478 Ala Val Phe Leu Val GlyGln Ala Phe Thr Phe Arg Pro Arg Arg His 145 150 155 CAA ACG GTC CAG ACCTGT AAC TGC TCA CTG TAC CCA GGC CAT CTT TCA 526 Gln Thr Val Gln Thr CysAsn Cys Ser Leu Tyr Pro Gly His Leu Ser 160 165 170 175 GGA CAC CGA ATGGCT 541 Gly His Arg Met Ala 180 (2) INFORMATION FOR SEQ ID NO: 14: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 180 amino acids (B) TYPE: aminoacid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 14: Val Gly Ala Pro Val Gly Gly Val Ala Arg AlaLeu Ala His Gly Val 1 5 10 15 Arg Ala Leu Glu Asp Gly Ile Asn Phe AlaThr Gly Asn Leu Pro Gly 20 25 30 Cys Ser Phe Ser Ile Phe Leu Leu Ala LeuPhe Ser Cys Leu Ile His 35 40 45 Pro Ala Ala Ser Leu Glu Trp Arg Asn ThrSer Gly Leu Tyr Val Leu 50 55 60 Thr Asn Asp Cys Ser Asn Ser Ser Ile ValTyr Glu Ala Asp Asp Val 65 70 75 80 Ile Leu His Thr Pro Gly Cys Val ProCys Val Gln Asp Gly Asn Thr 85 90 95 Ser Ala Cys Trp Thr Pro Val Thr ProThr Val Ala Val Arg Tyr Val 100 105 110 Gly Ala Thr Thr Ala Ser Ile ArgArg His Val Asp Met Leu Val Gly 115 120 125 Ala Ala Thr Met Cys Ser AlaLeu Tyr Val Gly Asp Met Cys Gly Ala 130 135 140 Val Phe Leu Val Gly GlnAla Phe Thr Phe Arg Pro Arg Arg His Gln 145 150 155 160 Thr Val Gln ThrCys Asn Cys Ser Leu Tyr Pro Gly His Leu Ser Gly 165 170 175 His Arg MetAla 180 (2) INFORMATION FOR SEQ ID NO: 15: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 541 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS:single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL:NO (iii) ANTI-SENSE: NO (vii) IMMEDIATE SOURCE: (B) CLONE: HD10-2-14(ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 2..541 (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 15: C GTC GGC GCT CCT GTA GGA GGC GTC GCA AGAGCC CTT GCG CAT GGC 46 Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala LeuAla His Gly 1 5 10 15 GTG AGG GCC CTT GAA GAC GGG ATA AAT TTC GCA ACAGGG AAT TTG CCC 94 Val Arg Ala Leu Glu Asp Gly Ile Asn Phe Ala Thr GlyAsn Leu Pro 20 25 30 GGT TGC TCC TTT TCT ATC TTC CTT CCT GCT CTG TTC TCTTGC TTA ATC 142 Gly Cys Ser Phe Ser Ile Phe Leu Pro Ala Leu Phe Ser CysLeu Ile 35 40 45 CAT CCA GCA GCT AGT CTA GAG TGG CGG AAC ACG TCT GGC CTCTAT GTC 190 His Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser Gly Leu TyrVal 50 55 60 CTT ACC AAC GAC TGT TCC AAT AGC AGT ATT GTG TAT GAG GCC GATGAC 238 Leu Thr Asn Asp Cys Ser Asn Ser Ser Ile Val Tyr Glu Ala Asp Asp65 70 75 GTT ATT CTG CAC ACA CCC GGC TGT GTA CCT TGT GTT CAG GAC GGT AAT286 Val Ile Leu His Thr Pro Gly Cys Val Pro Cys Val Gln Asp Gly Asn 8085 90 95 ACA TCT GCG TGC TGG ACC CCA GTG ACA CCT ACA GTG GCA GTC AGG TAC334 Thr Ser Ala Cys Trp Thr Pro Val Thr Pro Thr Val Ala Val Arg Tyr 100105 110 GTC GGA GCA ACC ACC GCT TCG ATA CGC AGG CAT GTA GAC ATA TTG GTG382 Val Gly Ala Thr Thr Ala Ser Ile Arg Arg His Val Asp Ile Leu Val 115120 125 GGC GCG GCC ACA ATG TGC TCT GCT CTC TAC GTG GGT GAT ATG TGT GGG430 Gly Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly Asp Met Cys Gly 130135 140 GCC GTC TTC CTC GTG GGA CAA GCC TTC ACG TTC AGA CCT CGT CGC CAT478 Ala Val Phe Leu Val Gly Gln Ala Phe Thr Phe Arg Pro Arg Arg His 145150 155 CAA ACG GTC CAG ACC TGT AAC TGC TCA CTG TAC CCA GGC CAT CTT TCA526 Gln Thr Val Gln Thr Cys Asn Cys Ser Leu Tyr Pro Gly His Leu Ser 160165 170 175 GGA CAC CGA ATG GCT 541 Gly His Arg Met Ala 180 (2)INFORMATION FOR SEQ ID NO: 16: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH:180 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULETYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: Val Gly Ala ProVal Gly Gly Val Ala Arg Ala Leu Ala His Gly Val 1 5 10 15 Arg Ala LeuGlu Asp Gly Ile Asn Phe Ala Thr Gly Asn Leu Pro Gly 20 25 30 Cys Ser PheSer Ile Phe Leu Pro Ala Leu Phe Ser Cys Leu Ile His 35 40 45 Pro Ala AlaSer Leu Glu Trp Arg Asn Thr Ser Gly Leu Tyr Val Leu 50 55 60 Thr Asn AspCys Ser Asn Ser Ser Ile Val Tyr Glu Ala Asp Asp Val 65 70 75 80 Ile LeuHis Thr Pro Gly Cys Val Pro Cys Val Gln Asp Gly Asn Thr 85 90 95 Ser AlaCys Trp Thr Pro Val Thr Pro Thr Val Ala Val Arg Tyr Val 100 105 110 GlyAla Thr Thr Ala Ser Ile Arg Arg His Val Asp Ile Leu Val Gly 115 120 125Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly Asp Met Cys Gly Ala 130 135140 Val Phe Leu Val Gly Gln Ala Phe Thr Phe Arg Pro Arg Arg His Gln 145150 155 160 Thr Val Gln Thr Cys Asn Cys Ser Leu Tyr Pro Gly His Leu SerGly 165 170 175 His Arg Met Ala 180 (2) INFORMATION FOR SEQ ID NO: 17 :(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 541 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (vii) IMMEDIATESOURCE: (B) CLONE: HD10-2-21 (ix) FEATURE: (A) NAME/KEY: CDS (B)LOCATION: 2..541 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: C GTC GGC GCTCCT GTA GGA GGC GTC GCA AGA GCC CTT GCG CAT GGC 46 Val Gly Ala Pro ValGly Gly Val Ala Arg Ala Leu Ala His Gly 1 5 10 15 GTG AGG GCC CTT GAAGAC GGG ATA AAT TTC GCA ACA GGG AAT TTG CCC 94 Val Arg Ala Leu Glu AspGly Ile Asn Phe Ala Thr Gly Asn Leu Pro 20 25 30 GGT TGC TCC TTT TCT ATCTTC CTT CTT GCT CTG TTC TCT TGC TTA ATC 142 Gly Cys Ser Phe Ser Ile PheLeu Leu Ala Leu Phe Ser Cys Leu Ile 35 40 45 CAT CCA GCA GCT AGT CTA GAGTGG CGG AAC ACG TCT GGC CTC TAC GTC 190 His Pro Ala Ala Ser Leu Glu TrpArg Asn Thr Ser Gly Leu Tyr Val 50 55 60 CTT ACC AAC GAC TGT TCC AAT AGCAGT ATT GTG TAT GAG GCC GAT GAC 238 Leu Thr Asn Asp Cys Ser Asn Ser SerIle Val Tyr Glu Ala Asp Asp 65 70 75 GTT ATT CTG CAC ACA CCC GGC TGT GTACCT TGT GTT CAG GAC GGT AAT 286 Val Ile Leu His Thr Pro Gly Cys Val ProCys Val Gln Asp Gly Asn 80 85 90 95 ACA TCT GCG TGC TGG ACC CCA GTG ACACCT ACA GTG GCA GTC AGG TAC 334 Thr Ser Ala Cys Trp Thr Pro Val Thr ProThr Val Ala Val Arg Tyr 100 105 110 GTC GGA GCA ACC ACC GCT TCG ATA CGCAGG CAT GTA GAC ATA TTG GTG 382 Val Gly Ala Thr Thr Ala Ser Ile Arg ArgHis Val Asp Ile Leu Val 115 120 125 GGC GCG GCC ACG ATG TGC TCT GCT CTCTAC GTG GGT GAT ATG TGT GGG 430 Gly Ala Ala Thr Met Cys Ser Ala Leu TyrVal Gly Asp Met Cys Gly 130 135 140 GCC GTC TTC CTC GTG GGA CAA GCC TTCACG TTC AGA CCT CGT CGC CAT 478 Ala Val Phe Leu Val Gly Gln Ala Phe ThrPhe Arg Pro Arg Arg His 145 150 155 CAA ACG GTC CAG ACC TGT AAC TGC TCACTG TAC CCA GGC CAT CTT TCA 526 Gln Thr Val Gln Thr Cys Asn Cys Ser LeuTyr Pro Gly His Leu Ser 160 165 170 175 GGA CAC CGA ATG GCT 541 Gly HisArg Met Ala 180 (2) INFORMATION FOR SEQ ID NO: 18: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 180 amino acids (B) TYPE: amino acid (D)TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 18: Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala Leu Ala HisGly Val 1 5 10 15 Arg Ala Leu Glu Asp Gly Ile Asn Phe Ala Thr Gly AsnLeu Pro Gly 20 25 30 Cys Ser Phe Ser Ile Phe Leu Leu Ala Leu Phe Ser CysLeu Ile His 35 40 45 Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser Gly LeuTyr Val Leu 50 55 60 Thr Asn Asp Cys Ser Asn Ser Ser Ile Val Tyr Glu AlaAsp Asp Val 65 70 75 80 Ile Leu His Thr Pro Gly Cys Val Pro Cys Val GlnAsp Gly Asn Thr 85 90 95 Ser Ala Cys Trp Thr Pro Val Thr Pro Thr Val AlaVal Arg Tyr Val 100 105 110 Gly Ala Thr Thr Ala Ser Ile Arg Arg His ValAsp Ile Leu Val Gly 115 120 125 Ala Ala Thr Met Cys Ser Ala Leu Tyr ValGly Asp Met Cys Gly Ala 130 135 140 Val Phe Leu Val Gly Gln Ala Phe ThrPhe Arg Pro Arg Arg His Gln 145 150 155 160 Thr Val Gln Thr Cys Asn CysSer Leu Tyr Pro Gly His Leu Ser Gly 165 170 175 His Arg Met Ala 180 (2)INFORMATION FOR SEQ ID NO: 19: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH:541 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iii)ANTI-SENSE: NO (vii) IMMEDIATE SOURCE: (B) CLONE: BR36-9-13 (ix)FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 2..541 (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 19: C GTC GGC GCT CCC GTA GGA GGC GTC GCA AGAGCC CTT GCG CAT GGC 46 Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala LeuAla His Gly 1 5 10 15 GTG AGG GCC CTT GAA GAC GGG ATA AAT TTC GCA ACAGGG AAT TTG CCC 94 Val Arg Ala Leu Glu Asp Gly Ile Asn Phe Ala Thr GlyAsn Leu Pro 20 25 30 GGT TGC TCC TTT TCT ATT TTC CTT CTT GCT CTG TTC TCTTGC TTA ATT 142 Gly Cys Ser Phe Ser Ile Phe Leu Leu Ala Leu Phe Ser CysLeu Ile 35 40 45 CAT CCA GCA GCT AGT CTA GAG TGG CGG AAT ACG TCT GGC CTCTAT GTC 190 His Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser Gly Leu TyrVal 50 55 60 CTT ACC AAC GAC TGT TCC AAT AGC AGT ATT GTG TAC GAG GCC GATGAC 238 Leu Thr Asn Asp Cys Ser Asn Ser Ser Ile Val Tyr Glu Ala Asp Asp65 70 75 GTT ATT CTG CAC ACA CCC GGC TGC ATA CCT TGT GTC CAG GAC GGC AAT286 Val Ile Leu His Thr Pro Gly Cys Ile Pro Cys Val Gln Asp Gly Asn 8085 90 95 ACA TCC ACG TGC TGG ACC CCA GTG ACA CCT ACA GTG GCA GTC AAG TAC334 Thr Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val Ala Val Lys Tyr 100105 110 GTC GGA GCA ACC ACC GCT TCG ATA CGC AGT CAT GTG GAC CTA TTA GTG382 Val Gly Ala Thr Thr Ala Ser Ile Arg Ser His Val Asp Leu Leu Val 115120 125 GGC GCG GCC ACG ATG TGC TCA GCG CTC TAC GTG GGT GAT ATG TGT GGG430 Gly Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly Asp Met Cys Gly 130135 140 GCC GTC TTC CTT GTG GGA CAA GCC TTC ACG TTC AGA CCT CGT CGC CAT478 Ala Val Phe Leu Val Gly Gln Ala Phe Thr Phe Arg Pro Arg Arg His 145150 155 CAA ACG GTC CAG ACC TGT AAC TGC TCG CTG TAC CCA GGC CAT CTT TCA526 Gln Thr Val Gln Thr Cys Asn Cys Ser Leu Tyr Pro Gly His Leu Ser 160165 170 175 GGA CAT CGA ATG GCT 541 Gly His Arg Met Ala 180 (2)INFORMATION FOR SEQ ID NO: 20: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH:180 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULETYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: Val Gly Ala ProVal Gly Gly Val Ala Arg Ala Leu Ala His Gly Val 1 5 10 15 Arg Ala LeuGlu Asp Gly Ile Asn Phe Ala Thr Gly Asn Leu Pro Gly 20 25 30 Cys Ser PheSer Ile Phe Leu Leu Ala Leu Phe Ser Cys Leu Ile His 35 40 45 Pro Ala AlaSer Leu Glu Trp Arg Asn Thr Ser Gly Leu Tyr Val Leu 50 55 60 Thr Asn AspCys Ser Asn Ser Ser Ile Val Tyr Glu Ala Asp Asp Val 65 70 75 80 Ile LeuHis Thr Pro Gly Cys Ile Pro Cys Val Gln Asp Gly Asn Thr 85 90 95 Ser ThrCys Trp Thr Pro Val Thr Pro Thr Val Ala Val Lys Tyr Val 100 105 110 GlyAla Thr Thr Ala Ser Ile Arg Ser His Val Asp Leu Leu Val Gly 115 120 125Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly Asp Met Cys Gly Ala 130 135140 Val Phe Leu Val Gly Gln Ala Phe Thr Phe Arg Pro Arg Arg His Gln 145150 155 160 Thr Val Gln Thr Cys Asn Cys Ser Leu Tyr Pro Gly His Leu SerGly 165 170 175 His Arg Met Ala 180 (2) INFORMATION FOR SEQ ID NO: 21:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 541 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (vii) IMMEDIATESOURCE: (B) CLONE: BR36-9-20 (ix) FEATURE: (A) NAME/KEY: CDS (B)LOCATION: 2..541 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: C GTC GGC GCTCCC GTA GGA GGC GTC GCA AGA GCC CTT GCG CAT GGC 46 Val Gly Ala Pro ValGly Gly Val Ala Arg Ala Leu Ala His Gly 1 5 10 15 GTG AGG GCC CTT GAAGAC GGG ATA AAT TTC GCA ACA GGG AAT TTG CCC 94 Val Arg Ala Leu Glu AspGly Ile Asn Phe Ala Thr Gly Asn Leu Pro 20 25 30 GGT TGC TCC TTT TCT ATTTTC CTT CTT GCT CTG TTC TCT TGC TTA ATT 142 Gly Cys Ser Phe Ser Ile PheLeu Leu Ala Leu Phe Ser Cys Leu Ile 35 40 45 CAT CCA GCA GCT AGT CTA GAGTGG CGG AAT ACG TCT GGC CTC TAT GTC 190 His Pro Ala Ala Ser Leu Glu TrpArg Asn Thr Ser Gly Leu Tyr Val 50 55 60 CTT ACC AAC GAC TGT TCC AAT AGCAGT ATT GTG TAC GAG GCC GAT GAC 238 Leu Thr Asn Asp Cys Ser Asn Ser SerIle Val Tyr Glu Ala Asp Asp 65 70 75 GTT ATT CTG CAC ACA CCC GGC TGC ATACCT TGT GTC CAG GAC GGC AAT 286 Val Ile Leu His Thr Pro Gly Cys Ile ProCys Val Gln Asp Gly Asn 80 85 90 95 ACA TCC ACG TGC TGG ACC CCA GTG ACACCT ACA GTG GCA GTC AAG TAC 334 Thr Ser Thr Cys Trp Thr Pro Val Thr ProThr Val Ala Val Lys Tyr 100 105 110 GTC GGA GCA ACC ACC GCT TCG ATA CGCAGT CAT GTG GAC CTA TTA GTG 382 Val Gly Ala Thr Thr Ala Ser Ile Arg SerHis Val Asp Leu Leu Val 115 120 125 GGC GCG GCC ACG ATG TGC TCT GCG CTCTAC GTG GGT GAC ATG TGT GGG 430 Gly Ala Ala Thr Met Cys Ser Ala Leu TyrVal Gly Asp Met Cys Gly 130 135 140 GCT GTC TTC CTC GTG GGA CAA GCC TTCACG TTC AGA CCT CGT CGC CAT 478 Ala Val Phe Leu Val Gly Gln Ala Phe ThrPhe Arg Pro Arg Arg His 145 150 155 CAA ACG GTC CAG ACC TGT AAC TGC TCGCTG TAC CCA GGC CAT CTT TCA 526 Gln Thr Val Gln Thr Cys Asn Cys Ser LeuTyr Pro Gly His Leu Ser 160 165 170 175 GGA CAT CGA ATG GCT 541 Gly HisArg Met Ala 180 (2) INFORMATION FOR SEQ ID NO: 22: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 180 amino acids (B) TYPE: amino acid (D)TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 22: Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala Leu Ala HisGly Val 1 5 10 15 Arg Ala Leu Glu Asp Gly Ile Asn Phe Ala Thr Gly AsnLeu Pro Gly 20 25 30 Cys Ser Phe Ser Ile Phe Leu Leu Ala Leu Phe Ser CysLeu Ile His 35 40 45 Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser Gly LeuTyr Val Leu 50 55 60 Thr Asn Asp Cys Ser Asn Ser Ser Ile Val Tyr Glu AlaAsp Asp Val 65 70 75 80 Ile Leu His Thr Pro Gly Cys Ile Pro Cys Val GlnAsp Gly Asn Thr 85 90 95 Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val AlaVal Lys Tyr Val 100 105 110 Gly Ala Thr Thr Ala Ser Ile Arg Ser His ValAsp Leu Leu Val Gly 115 120 125 Ala Ala Thr Met Cys Ser Ala Leu Tyr ValGly Asp Met Cys Gly Ala 130 135 140 Val Phe Leu Val Gly Gln Ala Phe ThrPhe Arg Pro Arg Arg His Gln 145 150 155 160 Thr Val Gln Thr Cys Asn CysSer Leu Tyr Pro Gly His Leu Ser Gly 165 170 175 His Arg Met Ala 180 (2)INFORMATION FOR SEQ ID NO: 23: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH:541 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iii)ANTI-SENSE: NO (vii) IMMEDIATE SOURCE: (B) CLONE: BR33-1-10 (ix)FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 2..541 (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 23: C GTC GGC GCT CCC GTA GGA GGC GTC GCA AGAGCC CTT GCG CAT GGC 46 Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala LeuAla His Gly 1 5 10 15 GTG AGG GCC CTT GAG GAC GGG ATA AAC TTC GCA ACAGGG AAT TTG CCC 94 Val Arg Ala Leu Glu Asp Gly Ile Asn Phe Ala Thr GlyAsn Leu Pro 20 25 30 GGT TGC TCC TTT TCT ATC TTC CTT CTT GCT CTG TTC TCTTGC TTA ATC 142 Gly Cys Ser Phe Ser Ile Phe Leu Leu Ala Leu Phe Ser CysLeu Ile 35 40 45 CAT CCA GCA GCT GGT CTA GAG TGG CGG AAT ACG TCT GGC CTCTAT GTC 190 His Pro Ala Ala Gly Leu Glu Trp Arg Asn Thr Ser Gly Leu TyrVal 50 55 60 CTT ACC AAC GAC TGT TCC AAT AGT AGT ATT GTG TAT GAG GCC GATGAC 238 Leu Thr Asn Asp Cys Ser Asn Ser Ser Ile Val Tyr Glu Ala Asp Asp65 70 75 GTT ATT CTG CAC GCG CCC GGC TGT GTA CCT TGT GTC CAG GAC GGC AAT286 Val Ile Leu His Ala Pro Gly Cys Val Pro Cys Val Gln Asp Gly Asn 8085 90 95 ACG TCT ACA TGC TGG ACC CCA GTA ACA CCT ACA GTG GCA GTC AGG TAC334 Thr Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val Ala Val Arg Tyr 100105 110 GTC GGG GCA ACC ACC GCT TCG ATA CGC AGT CAT GTG GAC CTG TTA GTA382 Val Gly Ala Thr Thr Ala Ser Ile Arg Ser His Val Asp Leu Leu Val 115120 125 GGC GCG GCC ACG ATG TGC TCT GCG CTT TAC GTG GGT GAT ATG TGT GGG430 Gly Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly Asp Met Cys Gly 130135 140 GCC GTC TTC CTC GTG GGA CAA GCC TTC ACG TTC AGA CCC CGC CGC CAT478 Ala Val Phe Leu Val Gly Gln Ala Phe Thr Phe Arg Pro Arg Arg His 145150 155 CAA ACG GTC CAG ACC TGT AAC TGC TCG CTG TAC CCA GGC CAT CTT TCA526 Gln Thr Val Gln Thr Cys Asn Cys Ser Leu Tyr Pro Gly His Leu Ser 160165 170 175 GGA CAT CGC ATG GCT 541 Gly His Arg Met Ala 180 (2)INFORMATION FOR SEQ ID NO: 24: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH:180 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULETYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: Val Gly Ala ProVal Gly Gly Val Ala Arg Ala Leu Ala His Gly Val 1 5 10 15 Arg Ala LeuGlu Asp Gly Ile Asn Phe Ala Thr Gly Asn Leu Pro Gly 20 25 30 Cys Ser PheSer Ile Phe Leu Leu Ala Leu Phe Ser Cys Leu Ile His 35 40 45 Pro Ala AlaGly Leu Glu Trp Arg Asn Thr Ser Gly Leu Tyr Val Leu 50 55 60 Thr Asn AspCys Ser Asn Ser Ser Ile Val Tyr Glu Ala Asp Asp Val 65 70 75 80 Ile LeuHis Ala Pro Gly Cys Val Pro Cys Val Gln Asp Gly Asn Thr 85 90 95 Ser ThrCys Trp Thr Pro Val Thr Pro Thr Val Ala Val Arg Tyr Val 100 105 110 GlyAla Thr Thr Ala Ser Ile Arg Ser His Val Asp Leu Leu Val Gly 115 120 125Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly Asp Met Cys Gly Ala 130 135140 Val Phe Leu Val Gly Gln Ala Phe Thr Phe Arg Pro Arg Arg His Gln 145150 155 160 Thr Val Gln Thr Cys Asn Cys Ser Leu Tyr Pro Gly His Leu SerGly 165 170 175 His Arg Met Ala 180 (2) INFORMATION FOR SEQ ID NO: 25:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 541 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (vii) IMMEDIATESOURCE: (B) CLONE: BR33-1-19 (ix) FEATURE: (A) NAME/KEY: CDS (B)LOCATION: 2..541 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: C GTC GGC GCTCCC GTA GGA GGC GTC GCA AGA GCC CTT GCG CAT GGC 46 Val Gly Ala Pro ValGly Gly Val Ala Arg Ala Leu Ala His Gly 1 5 10 15 GTG AGG GCC CTT GAGGAC GGG ATA AAC TTC GCA ACA GGG AAT TTG CCC 94 Val Arg Ala Leu Glu AspGly Ile Asn Phe Ala Thr Gly Asn Leu Pro 20 25 30 GGT TGC TCT TTT TCT ATCTTC CTT CTT GCT CTG TTC TCT TGC TTA ATC 142 Gly Cys Ser Phe Ser Ile PheLeu Leu Ala Leu Phe Ser Cys Leu Ile 35 40 45 CAT CCA GCA GCT GGT CTA GAGTGG CGG AAT ACG TCT GGC CTC TAT GTC 190 His Pro Ala Ala Gly Leu Glu TrpArg Asn Thr Ser Gly Leu Tyr Val 50 55 60 CTT ACC AAC GAC TGT TCC AAT AGTAGT ATT GTG TAT GAG GCC GAT GAC 238 Leu Thr Asn Asp Cys Ser Asn Ser SerIle Val Tyr Glu Ala Asp Asp 65 70 75 GTT ATT CTG CAC GCG CCC GGC TGT GTACCT TGT GTC CAG GAC GGC AAT 286 Val Ile Leu His Ala Pro Gly Cys Val ProCys Val Gln Asp Gly Asn 80 85 90 95 ACG TCT ACA TGC TGG ACC CCA GTA ACACCT ACA GTG GCA GTC AGG TAC 334 Thr Ser Thr Cys Trp Thr Pro Val Thr ProThr Val Ala Val Arg Tyr 100 105 110 GTC GGG GCA ACC ACC GCT TCG ATA CGCAGT CAT GTG GAC CTG TTA GTA 382 Val Gly Ala Thr Thr Ala Ser Ile Arg SerHis Val Asp Leu Leu Val 115 120 125 GGC GCG GCC ACG ATG TGC TCT GCG CTTTAC GTG GGT GAT ATG TGT GGG 430 Gly Ala Ala Thr Met Cys Ser Ala Leu TyrVal Gly Asp Met Cys Gly 130 135 140 GCC GTC TTC CTC GTG GGA CAA GCC TTCACG TTC AGA CCC CGC CGC CAT 478 Ala Val Phe Leu Val Gly Gln Ala Phe ThrPhe Arg Pro Arg Arg His 145 150 155 CAA ACG GTC CAG ACC TGT AAC TGC TCGCTG TAC CCA GGC CAT CTT TCA 526 Gln Thr Val Gln Thr Cys Asn Cys Ser LeuTyr Pro Gly His Leu Ser 160 165 170 175 GGA CAT CGA ATG GCT 541 Gly HisArg Met Ala 180 (2) INFORMATION FOR SEQ ID NO: 26: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 180 amino acids (B) TYPE: amino acid (D)TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 26: Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala Leu Ala HisGly Val 1 5 10 15 Arg Ala Leu Glu Asp Gly Ile Asn Phe Ala Thr Gly AsnLeu Pro Gly 20 25 30 Cys Ser Phe Ser Ile Phe Leu Leu Ala Leu Phe Ser CysLeu Ile His 35 40 45 Pro Ala Ala Gly Leu Glu Trp Arg Asn Thr Ser Gly LeuTyr Val Leu 50 55 60 Thr Asn Asp Cys Ser Asn Ser Ser Ile Val Tyr Glu AlaAsp Asp Val 65 70 75 80 Ile Leu His Ala Pro Gly Cys Val Pro Cys Val GlnAsp Gly Asn Thr 85 90 95 Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val AlaVal Arg Tyr Val 100 105 110 Gly Ala Thr Thr Ala Ser Ile Arg Ser His ValAsp Leu Leu Val Gly 115 120 125 Ala Ala Thr Met Cys Ser Ala Leu Tyr ValGly Asp Met Cys Gly Ala 130 135 140 Val Phe Leu Val Gly Gln Ala Phe ThrPhe Arg Pro Arg Arg His Gln 145 150 155 160 Thr Val Gln Thr Cys Asn CysSer Leu Tyr Pro Gly His Leu Ser Gly 165 170 175 His Arg Met Ala 180 (2)INFORMATION FOR SEQ ID NO: 27: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH:541 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iii)ANTI-SENSE: NO (vii) IMMEDIATE SOURCE: (B) CLONE: BR33-1-20 (ix)FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 2..541 (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 27: C GTC GGC GCT CCC GTA GGA GGC GTC GCA AGAGCC CTT GCG CAT GGC 46 Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala LeuAla His Gly 1 5 10 15 GTG AGG GCC CTT GAG GAC GGG ATA AAC TTC GCA ACAGGG AAT TTG CCC 94 Val Arg Ala Leu Glu Asp Gly Ile Asn Phe Ala Thr GlyAsn Leu Pro 20 25 30 GGT TGC TCT TTT TCT ATC TTC CTT CTT GCT CTG TTC TCTTGC TTA ATC 142 Gly Cys Ser Phe Ser Ile Phe Leu Leu Ala Leu Phe Ser CysLeu Ile 35 40 45 CAT CCA GCA GCT GGT CTA GAG TGG CGG AAT ACG TCT GGC CTCTAT GTC 190 His Pro Ala Ala Gly Leu Glu Trp Arg Asn Thr Ser Gly Leu TyrVal 50 55 60 CTT ACC AAC GAC TGT TCC AAT AGT AGT ATT GTG TAT GAG GCC GATGAC 238 Leu Thr Asn Asp Cys Ser Asn Ser Ser Ile Val Tyr Glu Ala Asp Asp65 70 75 GTT ATT CTG CAC GCG CCC GGC TGT GTA CCT TGT GTC CAG GAC GGC AAT286 Val Ile Leu His Ala Pro Gly Cys Val Pro Cys Val Gln Asp Gly Asn 8085 90 95 ACG TCT ACA TGC TGG ACC CCA GTA ACA CCT ACA GTG GCA GTC AGG TAC334 Thr Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val Ala Val Arg Tyr 100105 110 GTC GGG GCA ACC ACC GCT TCG ATA CGC AGT CAT GTG GAC CTG TTA GTA382 Val Gly Ala Thr Thr Ala Ser Ile Arg Ser His Val Asp Leu Leu Val 115120 125 GGC GCG GCC ACG ATG TGC TCT GCG CTT TAC GTG GGT GAT ATG TGT GGG430 Gly Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly Asp Met Cys Gly 130135 140 GCC GTC TTC CTC GTG GGA CAA GCC TTC ACG TTC AGA CCC CGC CGC CAT478 Ala Val Phe Leu Val Gly Gln Ala Phe Thr Phe Arg Pro Arg Arg His 145150 155 CAA ACG GTC CAG ACC TGT AAC TGC TCG CTG TAC CCA GGC CAT CTT TCA526 Gln Thr Val Gln Thr Cys Asn Cys Ser Leu Tyr Pro Gly His Leu Ser 160165 170 175 GGA CAT CGA ATG GCT 541 Gly His Arg Met Ala 180 (2)INFORMATION FOR SEQ ID NO: 28: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH:180 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULETYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: Val Gly Ala ProVal Gly Gly Val Ala Arg Ala Leu Ala His Gly Val 1 5 10 15 Arg Ala LeuGlu Asp Gly Ile Asn Phe Ala Thr Gly Asn Leu Pro Gly 20 25 30 Cys Ser PheSer Ile Phe Leu Leu Ala Leu Phe Ser Cys Leu Ile His 35 40 45 Pro Ala AlaGly Leu Glu Trp Arg Asn Thr Ser Gly Leu Tyr Val Leu 50 55 60 Thr Asn AspCys Ser Asn Ser Ser Ile Val Tyr Glu Ala Asp Asp Val 65 70 75 80 Ile LeuHis Ala Pro Gly Cys Val Pro Cys Val Gln Asp Gly Asn Thr 85 90 95 Ser ThrCys Trp Thr Pro Val Thr Pro Thr Val Ala Val Arg Tyr Val 100 105 110 GlyAla Thr Thr Ala Ser Ile Arg Ser His Val Asp Leu Leu Val Gly 115 120 125Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly Asp Met Cys Gly Ala 130 135140 Val Phe Leu Val Gly Gln Ala Phe Thr Phe Arg Pro Arg Arg His Gln 145150 155 160 Thr Val Gln Thr Cys Asn Cys Ser Leu Tyr Pro Gly His Leu SerGly 165 170 175 His Arg Met Ala 180 (2) INFORMATION FOR SEQ ID NO: 29:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 287 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (vii) IMMEDIATESOURCE: (B) CLONE: HCCl153 (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION:3..287 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: TA GAC TTT TGG GAG AGCGTC TTC ACT GGA CTA ACT CAC ATA GAT GCC 47 Asp Phe Trp Glu Ser Val PheThr Gly Leu Thr His Ile Asp Ala 1 5 10 15 CAC TTT CTG TCA CAG ACT AAGCAG CAG GGA CTC AAC TTC TCG TTC CTG 95 His Phe Leu Ser Gln Thr Lys GlnGln Gly Leu Asn Phe Ser Phe Leu 20 25 30 ACT GCC TAC CAA GCC ACT GTG TGCGCT CGC GCG CAG GCT CCT CCC CCA 143 Thr Ala Tyr Gln Ala Thr Val Cys AlaArg Ala Gln Ala Pro Pro Pro 35 40 45 AGT TGG GAC GAG ATG TGG AAG TGT CTCGTA CGG CTT AAG CCA ACA CTA 191 Ser Trp Asp Glu Met Trp Lys Cys Leu ValArg Leu Lys Pro Thr Leu 50 55 60 CAT GGA CCT ACG CCT CTT CTA TAT CGG TTGGGG CCT GTC CAA AAT GAA 239 His Gly Pro Thr Pro Leu Leu Tyr Arg Leu GlyPro Val Gln Asn Glu 65 70 75 ATC TGC TTG ACA CAC CCC ATC ACA AAA TAC ATCATG GCA TGC ATG TCA 287 Ile Cys Leu Thr His Pro Ile Thr Lys Tyr Ile MetAla Cys Met Ser 80 85 90 95 (2) INFORMATION FOR SEQ ID NO: 30: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 95 amino acids (B) TYPE: aminoacid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 30: Asp Phe Trp Glu Ser Val Phe Thr Gly Leu ThrHis Ile Asp Ala His 1 5 10 15 Phe Leu Ser Gln Thr Lys Gln Gln Gly LeuAsn Phe Ser Phe Leu Thr 20 25 30 Ala Tyr Gln Ala Thr Val Cys Ala Arg AlaGln Ala Pro Pro Pro Ser 35 40 45 Trp Asp Glu Met Trp Lys Cys Leu Val ArgLeu Lys Pro Thr Leu His 50 55 60 Gly Pro Thr Pro Leu Leu Tyr Arg Leu GlyPro Val Gln Asn Glu Ile 65 70 75 80 Cys Leu Thr His Pro Ile Thr Lys TyrIle Met Ala Cys Met Ser 85 90 95 (2) INFORMATION FOR SEQ ID NO: 31: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 401 base pairs (B) TYPE: nucleicacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (vii) IMMEDIATE SOURCE:(B) CLONE: HD10-1-25 (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION:3..401 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: TC CAA AAT GAA ATC TGCTTG ACA CAC CCC GTC ACA AAA TAC ATT ATG 47 Gln Asn Glu Ile Cys Leu ThrHis Pro Val Thr Lys Tyr Ile Met 1 5 10 15 GCA TGC ATG TCA GCT GAT CTGGAA GTA ACC ACC AGC ACC TGG GTG TTG 95 Ala Cys Met Ser Ala Asp Leu GluVal Thr Thr Ser Thr Trp Val Leu 20 25 30 CTT GGA GGG GTC CTC GCG GCC CTAGCG GCC TAC TGC TTG TCA GTC GGC 143 Leu Gly Gly Val Leu Ala Ala Leu AlaAla Tyr Cys Leu Ser Val Gly 35 40 45 TGC GTT GTA ATC GTG GGT CAT ATC GAGCTG GGG GGC AAG CCG GCA CTC 191 Cys Val Val Ile Val Gly His Ile Glu LeuGly Gly Lys Pro Ala Leu 50 55 60 GTT CCA GAC AAG GAG GTG TTG TAT CAA CAGTAC GAT GAG ATG GAG GAG 239 Val Pro Asp Lys Glu Val Leu Tyr Gln Gln TyrAsp Glu Met Glu Glu 65 70 75 TGC TCG CAA GCC GCC CCA TAC ATC GAA CAA GCTCAG GTA ATA GCC CAC 287 Cys Ser Gln Ala Ala Pro Tyr Ile Glu Gln Ala GlnVal Ile Ala His 80 85 90 95 CAG TTC AAG GAG AAA ATC CTT GGA CTG CTG CAGCGA GCC ACC CAA CAA 335 Gln Phe Lys Glu Lys Ile Leu Gly Leu Leu Gln ArgAla Thr Gln Gln 100 105 110 CAA GCT GTC ATT GAG CCC GTA ATA GCT TCC AACTGG CAA AAG CTT GAA 383 Gln Ala Val Ile Glu Pro Val Ile Ala Ser Asn TrpGln Lys Leu Glu 115 120 125 ACC TTC TGG CAC AAG CAT 401 Thr Phe Trp HisLys His 130 (2) INFORMATION FOR SEQ ID NO: 32: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 133 amino acids (B) TYPE: amino acid (D)TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 32: Gln Asn Glu Ile Cys Leu Thr His Pro Val Thr Lys Tyr IleMet Ala 1 5 10 15 Cys Met Ser Ala Asp Leu Glu Val Thr Thr Ser Thr TrpVal Leu Leu 20 25 30 Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu SerVal Gly Cys 35 40 45 Val Val Ile Val Gly His Ile Glu Leu Gly Gly Lys ProAla Leu Val 50 55 60 Pro Asp Lys Glu Val Leu Tyr Gln Gln Tyr Asp Glu MetGlu Glu Cys 65 70 75 80 Ser Gln Ala Ala Pro Tyr Ile Glu Gln Ala Gln ValIle Ala His Gln 85 90 95 Phe Lys Glu Lys Ile Leu Gly Leu Leu Gln Arg AlaThr Gln Gln Gln 100 105 110 Ala Val Ile Glu Pro Val Ile Ala Ser Asn TrpGln Lys Leu Glu Thr 115 120 125 Phe Trp His Lys His 130 (2) INFORMATIONFOR SEQ ID NO: 33: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 401 basepairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY:linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE:NO (vii) IMMEDIATE SOURCE: (B) CLONE: HD10-1-3 (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 3..401 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: TC CAA AAT GAA ATC TGC TTG ACA CAC CCC GTC ACA AAA TAC ATT ATG 47Gln Asn Glu Ile Cys Leu Thr His Pro Val Thr Lys Tyr Ile Met 1 5 10 15GCA TGC ATG TCA GCT GAT CTG GAA GTA ACC ACC AGC ACC TGG GTG TTG 95 AlaCys Met Ser Ala Asp Leu Glu Val Thr Thr Ser Thr Trp Val Leu 20 25 30 CTTGGA GGG GTC CTC GCG GCC CTA GCG GCC TAC TGC TTG TCA GTC GGC 143 Leu GlyGly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser Val Gly 35 40 45 TGC GTTGTA ATC GTG GGT CAT ATC GAG CTG GGG GGC AAG CCG GCA CTC 191 Cys Val ValIle Val Gly His Ile Glu Leu Gly Gly Lys Pro Ala Leu 50 55 60 GTT CCA GACAAG GAG GTG TTG TAT CAA CAG TAC GAT GAG ATG GAG GAG 239 Val Pro Asp LysGlu Val Leu Tyr Gln Gln Tyr Asp Glu Met Glu Glu 65 70 75 TGC TCG CAA GCCGCC CCA TAC ATC GAA CAA GCT CAG GTA ATA GCC CAC 287 Cys Ser Gln Ala AlaPro Tyr Ile Glu Gln Ala Gln Val Ile Ala His 80 85 90 95 CAG TTC AAG GAGAAA ATC CTT GGA CTG CTG CAG CGA GCC ACC CAA CAA 335 Gln Phe Lys Glu LysIle Leu Gly Leu Leu Gln Arg Ala Thr Gln Gln 100 105 110 CAA GCT GTC ATTGAG CCC GTA ATA GCT TCC AAC TGG CAA AAG CTT GAA 383 Gln Ala Val Ile GluPro Val Ile Ala Ser Asn Trp Gln Lys Leu Glu 115 120 125 ACC TTC TGG CACAAG CAT 401 Thr Phe Trp His Lys His 130 (2) INFORMATION FOR SEQ ID NO:34: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 133 amino acids (B) TYPE:amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 34: Gln Asn Glu Ile Cys Leu Thr His ProVal Thr Lys Tyr Ile Met Ala 1 5 10 15 Cys Met Ser Ala Asp Leu Glu ValThr Thr Ser Thr Trp Val Leu Leu 20 25 30 Gly Gly Val Leu Ala Ala Leu AlaAla Tyr Cys Leu Ser Val Gly Cys 35 40 45 Val Val Ile Val Gly His Ile GluLeu Gly Gly Lys Pro Ala Leu Val 50 55 60 Pro Asp Lys Glu Val Leu Tyr GlnGln Tyr Asp Glu Met Glu Glu Cys 65 70 75 80 Ser Gln Ala Ala Pro Tyr IleGlu Gln Ala Gln Val Ile Ala His Gln 85 90 95 Phe Lys Glu Lys Ile Leu GlyLeu Leu Gln Arg Ala Thr Gln Gln Gln 100 105 110 Ala Val Ile Glu Pro ValIle Ala Ser Asn Trp Gln Lys Leu Glu Thr 115 120 125 Phe Trp His Lys His130 (2) INFORMATION FOR SEQ ID NO: 35: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 401 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single(D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO(iii) ANTI-SENSE: NO (vii) IMMEDIATE SOURCE: (B) CLONE: BR36-20-164 (ix)FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 3..401 (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 35: TC CAA AAT GAA ATC TGC TTG ACA CAC CCC ATCACA AAA TAC ATC ATG 47 Gln Asn Glu Ile Cys Leu Thr His Pro Ile Thr LysTyr Ile Met 1 5 10 15 GCA TGC ATG TCA GCT GAT CTG GAA GTA ACC ACC AGCACC TGG GTT TTG 95 Ala Cys Met Ser Ala Asp Leu Glu Val Thr Thr Ser ThrTrp Val Leu 20 25 30 CTT GGA GGG GTC CTC GCG GCC CTA GCG GCC TAC TGC TTGTCA GTC GGT 143 Leu Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu SerVal Gly 35 40 45 TGT GTT GTG ATT GTG GGT CAT ATC GAG CTG GGG GGC AAG CCGGCA ATC 191 Cys Val Val Ile Val Gly His Ile Glu Leu Gly Gly Lys Pro AlaIle 50 55 60 GTT CCA GAC AAA GAG GTG TTG TAT CAA CAA TAC GAT GAG ATG GAAGAG 239 Val Pro Asp Lys Glu Val Leu Tyr Gln Gln Tyr Asp Glu Met Glu Glu65 70 75 TGC TCA CAA GCT GCC CCA TAT ATC GAA CAA GCT CAG GTA ATA GCT CAC287 Cys Ser Gln Ala Ala Pro Tyr Ile Glu Gln Ala Gln Val Ile Ala His 8085 90 95 CAG TTC AAG GGA AAA GTC CTT GGA TTG CTG CAG CGA GCC ACC CAA CAA335 Gln Phe Lys Gly Lys Val Leu Gly Leu Leu Gln Arg Ala Thr Gln Gln 100105 110 CAA GCT GTC ATT GAG CCC ATA GTA ACT ACC AAC TGG CAA AAG CTT GAG383 Gln Ala Val Ile Glu Pro Ile Val Thr Thr Asn Trp Gln Lys Leu Glu 115120 125 GCC TTT TGG CAC AAG CAT 401 Ala Phe Trp His Lys His 130 (2)INFORMATION FOR SEQ ID NO: 36: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH:133 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULETYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: Gln Asn Glu IleCys Leu Thr His Pro Ile Thr Lys Tyr Ile Met Ala 1 5 10 15 Cys Met SerAla Asp Leu Glu Val Thr Thr Ser Thr Trp Val Leu Leu 20 25 30 Gly Gly ValLeu Ala Ala Leu Ala Ala Tyr Cys Leu Ser Val Gly Cys 35 40 45 Val Val IleVal Gly His Ile Glu Leu Gly Gly Lys Pro Ala Ile Val 50 55 60 Pro Asp LysGlu Val Leu Tyr Gln Gln Tyr Asp Glu Met Glu Glu Cys 65 70 75 80 Ser GlnAla Ala Pro Tyr Ile Glu Gln Ala Gln Val Ile Ala His Gln 85 90 95 Phe LysGly Lys Val Leu Gly Leu Leu Gln Arg Ala Thr Gln Gln Gln 100 105 110 AlaVal Ile Glu Pro Ile Val Thr Thr Asn Trp Gln Lys Leu Glu Ala 115 120 125Phe Trp His Lys His 130 (2) INFORMATION FOR SEQ ID NO: 37: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 401 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (vii) IMMEDIATE SOURCE: (B) CLONE:BR36-20-166 (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 3..401 (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 37: TC CAA AAT GAA ATC TGC TTG ACA CACCCC ATC ACA AAA TAC ATC ATG 47 Gln Asn Glu Ile Cys Leu Thr His Pro IleThr Lys Tyr Ile Met 1 5 10 15 GCA TGC ATG TCA GCT GAT CTG GAA GTA ACCACC AGC ACC TGG GTT TTG 95 Ala Cys Met Ser Ala Asp Leu Glu Val Thr ThrSer Thr Trp Val Leu 20 25 30 CTT GGA GGG GTC CTC GCG GCC CTA GCG GCC TACTGC TTG TCA GTC GGT 143 Leu Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr CysLeu Ser Val Gly 35 40 45 TGT GTT GTG ATT GTG GGT CAT ATC GAG CTG GGG GGCAAG CCG GCA ATC 191 Cys Val Val Ile Val Gly His Ile Glu Leu Gly Gly LysPro Ala Ile 50 55 60 GTT CCA GAC AAA GAG GTG TTG TAT CAA CAA TAC GAT GAGATG GAA GAG 239 Val Pro Asp Lys Glu Val Leu Tyr Gln Gln Tyr Asp Glu MetGlu Glu 65 70 75 TGC TCA CAA GCT GCC CCA TAT ATC GAA CAA GCT CAG GTG ATAGCT CAC 287 Cys Ser Gln Ala Ala Pro Tyr Ile Glu Gln Ala Gln Val Ile AlaHis 80 85 90 95 CAG TTC AAG GAA AAA GTC CTT GGA TTG CTG CAG CGA GCC ACCCAA CAA 335 Gln Phe Lys Glu Lys Val Leu Gly Leu Leu Gln Arg Ala Thr GlnGln 100 105 110 CAA GCT GTC ATT GAG CCC ATA GTA ACT ACC AAC TGG CAA AAGCTT GAG 383 Gln Ala Val Ile Glu Pro Ile Val Thr Thr Asn Trp Gln Lys LeuGlu 115 120 125 GCC TTT TGG CAC AAG CAT 401 Ala Phe Trp His Lys His 130(2) INFORMATION FOR SEQ ID NO: 38: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 133 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii)MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: Gln AsnGlu Ile Cys Leu Thr His Pro Ile Thr Lys Tyr Ile Met Ala 1 5 10 15 CysMet Ser Ala Asp Leu Glu Val Thr Thr Ser Thr Trp Val Leu Leu 20 25 30 GlyGly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser Val Gly Cys 35 40 45 ValVal Ile Val Gly His Ile Glu Leu Gly Gly Lys Pro Ala Ile Val 50 55 60 ProAsp Lys Glu Val Leu Tyr Gln Gln Tyr Asp Glu Met Glu Glu Cys 65 70 75 80Ser Gln Ala Ala Pro Tyr Ile Glu Gln Ala Gln Val Ile Ala His Gln 85 90 95Phe Lys Glu Lys Val Leu Gly Leu Leu Gln Arg Ala Thr Gln Gln Gln 100 105110 Ala Val Ile Glu Pro Ile Val Thr Thr Asn Trp Gln Lys Leu Glu Ala 115120 125 Phe Trp His Lys His 130 (2) INFORMATION FOR SEQ ID NO: 39: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 401 base pairs (B) TYPE: nucleicacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (vii) IMMEDIATE SOURCE:(B) CLONE: BR36-20-165 (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION:3..401 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: TC CAA AAT GAA ATC TGCTTG ACA CAC CCC ATC ACA AAA TAC ATC ATG 47 Gln Asn Glu Ile Cys Leu ThrHis Pro Ile Thr Lys Tyr Ile Met 1 5 10 15 GCA TGC ATG TCA GCT GAT CTGGAA GTA ACC ACC AGC ACC TGG GTT TTG 95 Ala Cys Met Ser Ala Asp Leu GluVal Thr Thr Ser Thr Trp Val Leu 20 25 30 CTT GGA GGG GTC CTC GCG GCC CTAGCG GCC TAC TGC TTG TCA GTC GGT 143 Leu Gly Gly Val Leu Ala Ala Leu AlaAla Tyr Cys Leu Ser Val Gly 35 40 45 TGT GTT GTG ATT GTG GGT CAT ATC GAGCTG GGG GGC AAG CCG GCA ATC 191 Cys Val Val Ile Val Gly His Ile Glu LeuGly Gly Lys Pro Ala Ile 50 55 60 GTT CCA GAC AAA GAG GTG TTG TAT CAA CAATAC GAT GAG ATG GAA GAG 239 Val Pro Asp Lys Glu Val Leu Tyr Gln Gln TyrAsp Glu Met Glu Glu 65 70 75 TGC TCA CAA GCT GCC CCA TAT ATC GAA CAA GCTCAG GTA ATA GCT CAC 287 Cys Ser Gln Ala Ala Pro Tyr Ile Glu Gln Ala GlnVal Ile Ala His 80 85 90 95 CAG TTC AAG GAA AAA GTC CTT GGA TTG CTG CAGCGA GCC ACC CAA CAA 335 Gln Phe Lys Glu Lys Val Leu Gly Leu Leu Gln ArgAla Thr Gln Gln 100 105 110 CAA GCT GTC ATT GAG CCC ATA GTA ACT ACC AACTGG CAA AAG CTT GAG 383 Gln Ala Val Ile Glu Pro Ile Val Thr Thr Asn TrpGln Lys Leu Glu 115 120 125 GCC TTT TGG CAC AAG CAT 401 Ala Phe Trp HisLys His 130 (2) INFORMATION FOR SEQ ID NO: 40: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 133 amino acids (B) TYPE: amino acid (D)TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 40: Gln Asn Glu Ile Cys Leu Thr His Pro Ile Thr Lys Tyr IleMet Ala 1 5 10 15 Cys Met Ser Ala Asp Leu Glu Val Thr Thr Ser Thr TrpVal Leu Leu 20 25 30 Gly Gly Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu SerVal Gly Cys 35 40 45 Val Val Ile Val Gly His Ile Glu Leu Gly Gly Lys ProAla Ile Val 50 55 60 Pro Asp Lys Glu Val Leu Tyr Gln Gln Tyr Asp Glu MetGlu Glu Cys 65 70 75 80 Ser Gln Ala Ala Pro Tyr Ile Glu Gln Ala Gln ValIle Ala His Gln 85 90 95 Phe Lys Glu Lys Val Leu Gly Leu Leu Gln Arg AlaThr Gln Gln Gln 100 105 110 Ala Val Ile Glu Pro Ile Val Thr Thr Asn TrpGln Lys Leu Glu Ala 115 120 125 Phe Trp His Lys His 130 (2) INFORMATIONFOR SEQ ID NO: 41: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 509 basepairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY:linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE:NO (vii) IMMEDIATE SOURCE: (B) CLONE: PC-2-1 (ix) FEATURE: (A) NAME/KEY:CDS (B) LOCATION: 3..509 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: CCATG AGC ACG AAT CCT AAA CCT CAA AGA AAA ACC AAA AGA AAC ACC 47 Met SerThr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr 1 5 10 15 AAC CGTCGC CCA CAG GAC GTC AAG TTC CCG GGC GGT GGT CAG ATC GTT 95 Asn Arg ArgPro Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val 20 25 30 GGC GGA GTTTAC TTG TTG CCG CGC AGG GGC CCT AGG ATG GGT GTG CGC 143 Gly Gly Val TyrLeu Leu Pro Arg Arg Gly Pro Arg Met Gly Val Arg 35 40 45 GCG ACT CGG AAGACT TCG GAA CGG TCG CAA CCC CGT GGA CGG CGT CAG 191 Ala Thr Arg Lys ThrSer Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln 50 55 60 CCT ATT CCC AAG GCGCGC CAG CCC ACG GGC CGG TCC TGG GGT CAA CCC 239 Pro Ile Pro Lys Ala ArgGln Pro Thr Gly Arg Ser Trp Gly Gln Pro 65 70 75 GGG TAC CCT TGG CCC CTTTAC GCC AAT GAG GGC CTC GGG TGG GCA GGG 287 Gly Tyr Pro Trp Pro Leu TyrAla Asn Glu Gly Leu Gly Trp Ala Gly 80 85 90 95 TGG CTG CTC TCC CCT CGAGGC TCT CGG CCT AAT TGG GGC CCC AAT GAC 335 Trp Leu Leu Ser Pro Arg GlySer Arg Pro Asn Trp Gly Pro Asn Asp 100 105 110 CCC CGG CGA AAA TCG CGTAAT TTG GGT AAG GTC ATC GAT ACC CTA ACG 383 Pro Arg Arg Lys Ser Arg AsnLeu Gly Lys Val Ile Asp Thr Leu Thr 115 120 125 TGC GGA TTC GCC GAT CTCATG GGG TAT ATC CCG CTC GTA GGC GGC CCC 431 Cys Gly Phe Ala Asp Leu MetGly Tyr Ile Pro Leu Val Gly Gly Pro 130 135 140 ATT GGG GGC GTC GCA AGGGCT CTC GCA CAC GGT GTG AGG GTC CTT GAG 479 Ile Gly Gly Val Ala Arg AlaLeu Ala His Gly Val Arg Val Leu Glu 145 150 155 GAC GGG GTA AAC TAT GCAACA GGG AAT TTA 509 Asp Gly Val Asn Tyr Ala Thr Gly Asn Leu 160 165 (2)INFORMATION FOR SEQ ID NO: 42: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH:169 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULETYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: Met Ser Thr AsnPro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn 1 5 10 15 Arg Arg ProGln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly 20 25 30 Gly Val TyrLeu Leu Pro Arg Arg Gly Pro Arg Met Gly Val Arg Ala 35 40 45 Thr Arg LysThr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro 50 55 60 Ile Pro LysAla Arg Gln Pro Thr Gly Arg Ser Trp Gly Gln Pro Gly 65 70 75 80 Tyr ProTrp Pro Leu Tyr Ala Asn Glu Gly Leu Gly Trp Ala Gly Trp 85 90 95 Leu LeuSer Pro Arg Gly Ser Arg Pro Asn Trp Gly Pro Asn Asp Pro 100 105 110 ArgArg Lys Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys 115 120 125Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Gly Pro Ile 130 135140 Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 145150 155 160 Gly Val Asn Tyr Ala Thr Gly Asn Leu 165 (2) INFORMATION FORSEQ ID NO:43: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 509 base pairs(B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear(ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO(vii) IMMEDIATE SOURCE: (B) CLONE: PC-2-6 (ix) FEATURE: (A) NAME/KEY:CDS (B) LOCATION: 3..509 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: CCATG AGC ACG AAT CCT AAA CCT CAA AGA AAA ACC AAA AGA AAC ACC 47 Met SerThr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr 1 5 10 15 AAC CGTCGC CCA CAG GAC GTC AAG TTC CCG GGC GGT GGT CAG ATC GTT 95 Asn Arg ArgPro Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val 20 25 30 GGC GGA GTTTAC TTG TTG CCG CGC AGG GGC CCT AGG ATG GGT GTG CGC 143 Gly Gly Val TyrLeu Leu Pro Arg Arg Gly Pro Arg Met Gly Val Arg 35 40 45 GCG ACT CGG AAGACT TCG GAA CGG TCG CAA CCC CGT GGA CGG CGT CAG 191 Ala Thr Arg Lys ThrSer Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln 50 55 60 CCT ATT CCC AAG GCGCGC CAG CCC ACG GGC CGG TCC TGG GGT CAA CCC 239 Pro Ile Pro Lys Ala ArgGln Pro Thr Gly Arg Ser Trp Gly Gln Pro 65 70 75 GGG TAC CCT TGG CCC CTTTAC GCC AAT GAG GGC CTC GGG TGG GCA GGG 287 Gly Tyr Pro Trp Pro Leu TyrAla Asn Glu Gly Leu Gly Trp Ala Gly 80 85 90 95 TGG CTG CTC TCC CCT CGAGGC TCT CGG CCT AAT TGG GGC CCC AAT GAC 335 Trp Leu Leu Ser Pro Arg GlySer Arg Pro Asn Trp Gly Pro Asn Asp 100 105 110 CCC CGG CGA AAA TCG CGTAAT TTG GGT AAG GTC ATC GAT ACC CTA ACG 383 Pro Arg Arg Lys Ser Arg AsnLeu Gly Lys Val Ile Asp Thr Leu Thr 115 120 125 TGC GGA TTC GCC GAT CTCATG GGG TAT ATC CCG CTC GTA GGC GGC CCC 431 Cys Gly Phe Ala Asp Leu MetGly Tyr Ile Pro Leu Val Gly Gly Pro 130 135 140 ATT GGG GGC GTC GCA AGGGCT CTC GCA CAC GGT GTG AGG GTC CTT GAG 479 Ile Gly Gly Val Ala Arg AlaLeu Ala His Gly Val Arg Val Leu Glu 145 150 155 GAC GGG GTA AAC TAT GCAACA GGG AAT TTA 509 Asp Gly Val Asn Tyr Ala Thr Gly Asn Leu 160 165 (2)INFORMATION FOR SEQ ID NO: 44: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH:169 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULETYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: Met Ser Thr AsnPro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn 1 5 10 15 Arg Arg ProGln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly 20 25 30 Gly Val TyrLeu Leu Pro Arg Arg Gly Pro Arg Met Gly Val Arg Ala 35 40 45 Thr Arg LysThr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro 50 55 60 Ile Pro LysAla Arg Gln Pro Thr Gly Arg Ser Trp Gly Gln Pro Gly 65 70 75 80 Tyr ProTrp Pro Leu Tyr Ala Asn Glu Gly Leu Gly Trp Ala Gly Trp 85 90 95 Leu LeuSer Pro Arg Gly Ser Arg Pro Asn Trp Gly Pro Asn Asp Pro 100 105 110 ArgArg Lys Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys 115 120 125Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Gly Pro Ile 130 135140 Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 145150 155 160 Gly Val Asn Tyr Ala Thr Gly Asn Leu 165 (2) INFORMATION FORSEQ ID NO: 45: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 580 base pairs(B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear(ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO(vii) IMMEDIATE SOURCE: (B) CLONE: PC-4-1 (ix) FEATURE: (A) NAME/KEY:CDS (B) LOCATION: 2..580 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: A ACGTGC GGA TTC GCC GAT CTC ATG GGG TAT ATC CCG CTC GTA GGC 46 Thr Cys GlyPhe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly 1 5 10 15 GGC CCC ATTGGG GGC GTC GCA AGG GCT CTC GCA CAC GGT GTG AGG GTC 94 Gly Pro Ile GlyGly Val Ala Arg Ala Leu Ala His Gly Val Arg Val 20 25 30 CTT GAG GAC GGGGTA AAC TAT GCA ACA GGG AAT TTA CCC GGT TGC TCT 142 Leu Glu Asp Gly ValAsn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser 35 40 45 TTC TCT ATC TTT ATTCTT GCT CTT CTC TCG TGT CTG ACC GTT CCG GCC 190 Phe Ser Ile Phe Ile LeuAla Leu Leu Ser Cys Leu Thr Val Pro Ala 50 55 60 TCT GCA GTT CCC TAC CGAAAT GCC TCT GGG ATT TAT CAT GTT ACC AAT 238 Ser Ala Val Pro Tyr Arg AsnAla Ser Gly Ile Tyr His Val Thr Asn 65 70 75 GAT TGC CCA AAC TCT TCC ATAGTC TAT GAG GCA GAT AAC CTG ATC CTA 286 Asp Cys Pro Asn Ser Ser Ile ValTyr Glu Ala Asp Asn Leu Ile Leu 80 85 90 95 CAC GCA CCT GGT TGC GTG CCTTGT GTC ATG ACA GGT AAT GTG AGT AGA 334 His Ala Pro Gly Cys Val Pro CysVal Met Thr Gly Asn Val Ser Arg 100 105 110 TGC TGG GTC CAA ATT ACC CCTACA CTG TCA GCC CCG AGC CTC GGA GCA 382 Cys Trp Val Gln Ile Thr Pro ThrLeu Ser Ala Pro Ser Leu Gly Ala 115 120 125 GTC ACG GCT CCT CTT CGG AGAGCC GTT GAC TAC CTA GCG GGA GGG GCT 430 Val Thr Ala Pro Leu Arg Arg AlaVal Asp Tyr Leu Ala Gly Gly Ala 130 135 140 GCC CTC TGC TCC GCG TTA TACGTA GGA GAC GCG TGT GGG GCA CTA TTC 478 Ala Leu Cys Ser Ala Leu Tyr ValGly Asp Ala Cys Gly Ala Leu Phe 145 150 155 TTG GTA GGC CAA ATG TTC ACCTAT AGG CCT CGC CAG CAC GCT ACG GTG 526 Leu Val Gly Gln Met Phe Thr TyrArg Pro Arg Gln His Ala Thr Val 160 165 170 175 CAG AAC TGC AAC TGT TCCATT TAC AGT GGC CAT GTT ACC GGC CAC CGG 574 Gln Asn Cys Asn Cys Ser IleTyr Ser Gly His Val Thr Gly His Arg 180 185 190 ATG GCA 580 Met Ala (2)INFORMATION FOR SEQ ID NO: 46: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH:193 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULETYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: Thr Cys Gly PheAla Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Gly 1 5 10 15 Pro Ile GlyGly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu 20 25 30 Glu Asp GlyVal Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45 Ser Ile PheIle Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 50 55 60 Ala Val ProTyr Arg Asn Ala Ser Gly Ile Tyr His Val Thr Asn Asp 65 70 75 80 Cys ProAsn Ser Ser Ile Val Tyr Glu Ala Asp Asn Leu Ile Leu His 85 90 95 Ala ProGly Cys Val Pro Cys Val Met Thr Gly Asn Val Ser Arg Cys 100 105 110 TrpVal Gln Ile Thr Pro Thr Leu Ser Ala Pro Ser Leu Gly Ala Val 115 120 125Thr Ala Pro Leu Arg Arg Ala Val Asp Tyr Leu Ala Gly Gly Ala Ala 130 135140 Leu Cys Ser Ala Leu Tyr Val Gly Asp Ala Cys Gly Ala Leu Phe Leu 145150 155 160 Val Gly Gln Met Phe Thr Tyr Arg Pro Arg Gln His Ala Thr ValGln 165 170 175 Asn Cys Asn Cys Ser Ile Tyr Ser Gly His Val Thr Gly HisArg Met 180 185 190 Ala (2) INFORMATION FOR SEQ ID NO: 47: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 580 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (vii) IMMEDIATE SOURCE: (B) CLONE:PC-4-6 (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 2..580 (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 47: A ACG TGC GGA TTC GCC GAT CTC ATGGGG TAT ATC CCG CTC GTA GGC 46 Thr Cys Gly Phe Ala Asp Leu Met Gly TyrIle Pro Leu Val Gly 1 5 10 15 GGC CCC ATT GGG GGC GTC GCA AGG GCT CTCGCA CAC GGT GTG AGG GTC 94 Gly Pro Ile Gly Gly Val Ala Arg Ala Leu AlaHis Gly Val Arg Val 20 25 30 CTT GAG GAC GGG GTA AAC TAT GCA ACA GGG AATTTA CCC GGT TGC TCT 142 Leu Glu Asp Gly Val Asn Tyr Ala Thr Gly Asn LeuPro Gly Cys Ser 35 40 45 TTC TCT ATC TTT ATT CTT GCT CTT CTC TCG TGT CTGACC GTT CCG GCC 190 Phe Ser Ile Phe Ile Leu Ala Leu Leu Ser Cys Leu ThrVal Pro Ala 50 55 60 TCT GCA GTT CCC TAC CGA AAT GCC TCT GGG ATT TAT CATGTT ACC AAT 238 Ser Ala Val Pro Tyr Arg Asn Ala Ser Gly Ile Tyr His ValThr Asn 65 70 75 GAT TGC CCA AAC TCT TCC ATA GTC TAT GAG GCA GAT AAC CTGATC CTA 286 Asp Cys Pro Asn Ser Ser Ile Val Tyr Glu Ala Asp Asn Leu IleLeu 80 85 90 95 CAC GCA CCT GGT TGC GTG CCT TGT GTC ATG ACA GGT AAT GTGAGT AGA 334 His Ala Pro Gly Cys Val Pro Cys Val Met Thr Gly Asn Val SerArg 100 105 110 TGC TGG GTC CAA ATT ACC CCT ACA CTG TCA GCC CCG AGC CTCGGA GCA 382 Cys Trp Val Gln Ile Thr Pro Thr Leu Ser Ala Pro Ser Leu GlyAla 115 120 125 GTC ACG GCT CCT CTT CGG AGA GCC GTT GAC TAC CTA GCG GGAGGG GCT 430 Val Thr Ala Pro Leu Arg Arg Ala Val Asp Tyr Leu Ala Gly GlyAla 130 135 140 GCC CTC TGC TCC GCG TTA TAC GTA GGA GAC GCG TGT GGG GCACTA TTC 478 Ala Leu Cys Ser Ala Leu Tyr Val Gly Asp Ala Cys Gly Ala LeuPhe 145 150 155 TTG GTA GGC CAA ATG TTC ACC TAT AGG CCT CGC CAG CAC GCTACG GTG 526 Leu Val Gly Gln Met Phe Thr Tyr Arg Pro Arg Gln His Ala ThrVal 160 165 170 175 CAG AAC TGC AAC TGT TCC ATT TAC AGT GGC CAT GTT ACCGGC CAC CGG 574 Gln Asn Cys Asn Cys Ser Ile Tyr Ser Gly His Val Thr GlyHis Arg 180 185 190 ATG GCA 580 Met Ala (2) INFORMATION FOR SEQ ID NO:48: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 193 amino acids (B) TYPE:amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 48: Thr Cys Gly Phe Ala Asp Leu Met GlyTyr Ile Pro Leu Val Gly Gly 1 5 10 15 Pro Ile Gly Gly Val Ala Arg AlaLeu Ala His Gly Val Arg Val Leu 20 25 30 Glu Asp Gly Val Asn Tyr Ala ThrGly Asn Leu Pro Gly Cys Ser Phe 35 40 45 Ser Ile Phe Ile Leu Ala Leu LeuSer Cys Leu Thr Val Pro Ala Ser 50 55 60 Ala Val Pro Tyr Arg Asn Ala SerGly Ile Tyr His Val Thr Asn Asp 65 70 75 80 Cys Pro Asn Ser Ser Ile ValTyr Glu Ala Asp Asn Leu Ile Leu His 85 90 95 Ala Pro Gly Cys Val Pro CysVal Met Thr Gly Asn Val Ser Arg Cys 100 105 110 Trp Val Gln Ile Thr ProThr Leu Ser Ala Pro Ser Leu Gly Ala Val 115 120 125 Thr Ala Pro Leu ArgArg Ala Val Asp Tyr Leu Ala Gly Gly Ala Ala 130 135 140 Leu Cys Ser AlaLeu Tyr Val Gly Asp Ala Cys Gly Ala Leu Phe Leu 145 150 155 160 Val GlyGln Met Phe Thr Tyr Arg Pro Arg Gln His Ala Thr Val Gln 165 170 175 AsnCys Asn Cys Ser Ile Tyr Ser Gly His Val Thr Gly His Arg Met 180 185 190Ala (2) INFORMATION FOR SEQ ID NO: 49: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 959 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single(D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO(iii) ANTI-SENSE: NO (vii) IMMEDIATE SOURCE: (B) CLONE: PC-3-4 (ix)FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 3..959 (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 49: CC ATG AGC ACG AAT CCT AAA CCT CAA AGA AAAACC AAA AGA AAC ACC 47 Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr LysArg Asn Thr 1 5 10 15 AAC CGT CGC CCA CAG GAC GTC AAG TTC CCG GGC GGTGGT CAG ATC GTT 95 Asn Arg Arg Pro Gln Asp Val Lys Phe Pro Gly Gly GlyGln Ile Val 20 25 30 GGC GGA GTT TAC TTG TTG CCG CGC AGG GGC CCT AGG ATGGGT GTG CGC 143 Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Met GlyVal Arg 35 40 45 GCG ACT CGG AAG ACT TCG GAA CGG TCG CAA CCC CGT GGA CGGCGT CAG 191 Ala Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg ArgGln 50 55 60 CCT ATT CCC AAG GCG CGC CAG CCC ACG GGC CGG TCC TGG GGT CAACCC 239 Pro Ile Pro Lys Ala Arg Gln Pro Thr Gly Arg Ser Trp Gly Gln Pro65 70 75 GGG TAC CCT TGG CCC CTT TAC GCC AAT GAG GGC CTC GGG TGG GCA GGG287 Gly Tyr Pro Trp Pro Leu Tyr Ala Asn Glu Gly Leu Gly Trp Ala Gly 8085 90 95 TGG CTG CTC TCC CCT CGA GGC TCT CGG CCT AAT TGG GGC CCC AAT GAC335 Trp Leu Leu Ser Pro Arg Gly Ser Arg Pro Asn Trp Gly Pro Asn Asp 100105 110 CCC CGG CGA AAA TCG CGT AAT TTG GGT AAG GTC ATC GAT ACC CTA ACG383 Pro Arg Arg Lys Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr 115120 125 TGC GGA TTC GCC GAT CTC ATG GGG TAT ATC CCG CTC GTA GGC GGC CCC431 Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Gly Pro 130135 140 ATT GGG GGC GTC GCA AGG GCT CTC GCA CAC GGT GTG AGG GTC CTT GAG479 Ile Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu 145150 155 GAC GGG GTA AAC TAT GCA ACA GGG AAT TTA CCC GGT TGC TCT TTC TCT527 Asp Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser 160165 170 175 ATC TTT ATT CTT GCT CTT CTC TCG TGT CTG ACC GTT CCG GCC TCTGCA 575 Ile Phe Ile Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala180 185 190 GTT CCC TAC CGA AAT GCC TCT GGG ATT TAT CAT GTT ACC AAT GATTGC 623 Val Pro Tyr Arg Asn Ala Ser Gly Ile Tyr His Val Thr Asn Asp Cys195 200 205 CCA AAC TCT TCC ATA GTC TAT GAG GCA GAT AAC CTG ATC CTA CACGCA 671 Pro Asn Ser Ser Ile Val Tyr Glu Ala Asp Asn Leu Ile Leu His Ala210 215 220 CCT GGT TGC GTG CCT TGT GTC ATG ACA GGT AAT GTG AGT AGA TGCTGG 719 Pro Gly Cys Val Pro Cys Val Met Thr Gly Asn Val Ser Arg Cys Trp225 230 235 GTC CAA ATT ACC CCT ACA CTG TCA GCC CCG AGC CTC GGA GCA GTCACG 767 Val Gln Ile Thr Pro Thr Leu Ser Ala Pro Ser Leu Gly Ala Val Thr240 245 250 255 GCT CCT CTT CGG AGA GCC GTT GAC TAC CTA GCG GGA GGG GCTGCC CTC 815 Ala Pro Leu Arg Arg Ala Val Asp Tyr Leu Ala Gly Gly Ala AlaLeu 260 265 270 TGC TCC GCG TTA TAC GTA GGA GAC GCG TGT GGG GCA CTA TTCTTG GTA 863 Cys Ser Ala Leu Tyr Val Gly Asp Ala Cys Gly Ala Leu Phe LeuVal 275 280 285 GGC CAA ATG TTC ACC TAT AGG CCT CGC CAG CAC GCT ACG GTGCAG AAC 911 Gly Gln Met Phe Thr Tyr Arg Pro Arg Gln His Ala Thr Val GlnAsn 290 295 300 TGC AAC TGT TCC ATT TAC AGT GGC CAT GTT ACC GGC CAC CGGATG GCA 959 Cys Asn Cys Ser Ile Tyr Ser Gly His Val Thr Gly His Arg MetAla 305 310 315 (2) INFORMATION FOR SEQ ID NO: 50: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 319 amino acids (B) TYPE: amino acid (D)TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 50: Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg AsnThr Asn 1 5 10 15 Arg Arg Pro Gln Asp Val Lys Phe Pro Gly Gly Gly GlnIle Val Gly 20 25 30 Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Met GlyVal Arg Ala 35 40 45 Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly ArgArg Gln Pro 50 55 60 Ile Pro Lys Ala Arg Gln Pro Thr Gly Arg Ser Trp GlyGln Pro Gly 65 70 75 80 Tyr Pro Trp Pro Leu Tyr Ala Asn Glu Gly Leu GlyTrp Ala Gly Trp 85 90 95 Leu Leu Ser Pro Arg Gly Ser Arg Pro Asn Trp GlyPro Asn Asp Pro 100 105 110 Arg Arg Lys Ser Arg Asn Leu Gly Lys Val IleAsp Thr Leu Thr Cys 115 120 125 Gly Phe Ala Asp Leu Met Gly Tyr Ile ProLeu Val Gly Gly Pro Ile 130 135 140 Gly Gly Val Ala Arg Ala Leu Ala HisGly Val Arg Val Leu Glu Asp 145 150 155 160 Gly Val Asn Tyr Ala Thr GlyAsn Leu Pro Gly Cys Ser Phe Ser Ile 165 170 175 Phe Ile Leu Ala Leu LeuSer Cys Leu Thr Val Pro Ala Ser Ala Val 180 185 190 Pro Tyr Arg Asn AlaSer Gly Ile Tyr His Val Thr Asn Asp Cys Pro 195 200 205 Asn Ser Ser IleVal Tyr Glu Ala Asp Asn Leu Ile Leu His Ala Pro 210 215 220 Gly Cys ValPro Cys Val Met Thr Gly Asn Val Ser Arg Cys Trp Val 225 230 235 240 GlnIle Thr Pro Thr Leu Ser Ala Pro Ser Leu Gly Ala Val Thr Ala 245 250 255Pro Leu Arg Arg Ala Val Asp Tyr Leu Ala Gly Gly Ala Ala Leu Cys 260 265270 Ser Ala Leu Tyr Val Gly Asp Ala Cys Gly Ala Leu Phe Leu Val Gly 275280 285 Gln Met Phe Thr Tyr Arg Pro Arg Gln His Ala Thr Val Gln Asn Cys290 295 300 Asn Cys Ser Ile Tyr Ser Gly His Val Thr Gly His Arg Met Ala305 310 315 (2) INFORMATION FOR SEQ ID NO: 51: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 959 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (vii) IMMEDIATE SOURCE: (B) CLONE:PC-3-8 (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 3..959 (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 51: CC ATG AGC ACG AAT CCT AAA CCT CAAAGA AAA ACC AAA AGA AAC ACC 47 Met Ser Thr Asn Pro Lys Pro Gln Arg LysThr Lys Arg Asn Thr 1 5 10 15 AAC CGT CGC CCA CAG GAC GTC AAG TTC CCGGGC GGT GGT CAG ATC GTT 95 Asn Arg Arg Pro Gln Asp Val Lys Phe Pro GlyGly Gly Gln Ile Val 20 25 30 GGC GGA GTT TAC TTG TTG CCG CGC AGG GGC CCTAGG ATG GGT GTG CGC 143 Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro ArgMet Gly Val Arg 35 40 45 GCG ACT CGG AAG ACT TCG GAA CGG TCG CAA CCC CGTGGA CGG CGT CAG 191 Ala Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg GlyArg Arg Gln 50 55 60 CCT ATT CCC AAG GCG CGC CAG CCC ACG GGC CGG TCC TGGGGT CAA CCC 239 Pro Ile Pro Lys Ala Arg Gln Pro Thr Gly Arg Ser Trp GlyGln Pro 65 70 75 GGG TAC CCT TGG CCC CTT TAC GCC AAT GAG GGC CTC GGG TGGGCA GGG 287 Gly Tyr Pro Trp Pro Leu Tyr Ala Asn Glu Gly Leu Gly Trp AlaGly 80 85 90 95 TGG CTG CTC TCC CCT CGA GGC TCT CGG CCT AAT TGG GGC CCCAAT GAC 335 Trp Leu Leu Ser Pro Arg Gly Ser Arg Pro Asn Trp Gly Pro AsnAsp 100 105 110 CCC CGG CGA AAA TCG CGT AAT TTG GGT AAG GTC ATC GAT ACCCTA ACG 383 Pro Arg Arg Lys Ser Arg Asn Leu Gly Lys Val Ile Asp Thr LeuThr 115 120 125 TGC GGA TTC GCC GAT CTC ATG GGG TAC ATC CCG CTC GTA GGCGGC CCC 431 Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly GlyPro 130 135 140 GTT GGG GGC GTC GCA AGG GCT CTC GCA CAC GGT GTG AGG GTCCTT GAG 479 Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val LeuGlu 145 150 155 GAC GGG GTA AAC TAT CCA ACA GGG AAT TTA CCC GGT TGC TCTTTC TCT 527 Asp Gly Val Asn Tyr Pro Thr Gly Asn Leu Pro Gly Cys Ser PheSer 160 165 170 175 ATC TTT ATT CTT GCT CTT CTC TCG TGT CTG ACC GTT CCGGCC TCT GCA 575 Ile Phe Ile Leu Ala Leu Leu Ser Cys Leu Thr Val Pro AlaSer Ala 180 185 190 GTT CCC TAC CGA AAT GCC TCT GGG ATT TAT CAT GTT ACCAAT GAT TGC 623 Val Pro Tyr Arg Asn Ala Ser Gly Ile Tyr His Val Thr AsnAsp Cys 195 200 205 CCA AAC TCT TCC ATA GTC TAT GAG GCA GAT AAC CTG ATCCTA CAC GCA 671 Pro Asn Ser Ser Ile Val Tyr Glu Ala Asp Asn Leu Ile LeuHis Ala 210 215 220 CCT GGT TGC GTG CCT TGT GTC ATG ACA GGT AAT GTG AGTAGA TGC TGG 719 Pro Gly Cys Val Pro Cys Val Met Thr Gly Asn Val Ser ArgCys Trp 225 230 235 GTC CAA ATT ACC CCT ACA CTG TCA GCC CCG AGC CTC GGAGCA GTC ACG 767 Val Gln Ile Thr Pro Thr Leu Ser Ala Pro Ser Leu Gly AlaVal Thr 240 245 250 255 GCT CCT CTT CGG AGA GCC GTT GAC TAC CTA GCG GGAGGG GCT GCC CTC 815 Ala Pro Leu Arg Arg Ala Val Asp Tyr Leu Ala Gly GlyAla Ala Leu 260 265 270 TGC TCC GCG TTA TAC GTA GGA GAC GCG TGT GGG GCACTA TTC TTG GTA 863 Cys Ser Ala Leu Tyr Val Gly Asp Ala Cys Gly Ala LeuPhe Leu Val 275 280 285 GGC CAA ATG TTC ACC TAT AGG CCT CGC CAG CAC GCTACG GTG CAG AAC 911 Gly Gln Met Phe Thr Tyr Arg Pro Arg Gln His Ala ThrVal Gln Asn 290 295 300 TGC AAC TGT TCC ATT TAC AGT GGC CAT GTT ACC GGCCAC CGG ATG GCA 959 Cys Asn Cys Ser Ile Tyr Ser Gly His Val Thr Gly HisArg Met Ala 305 310 315 (2) INFORMATION FOR SEQ ID NO: 52: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 319 amino acids (B) TYPE: amino acid (D)TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 52: Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg AsnThr Asn 1 5 10 15 Arg Arg Pro Gln Asp Val Lys Phe Pro Gly Gly Gly GlnIle Val Gly 20 25 30 Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Met GlyVal Arg Ala 35 40 45 Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly ArgArg Gln Pro 50 55 60 Ile Pro Lys Ala Arg Gln Pro Thr Gly Arg Ser Trp GlyGln Pro Gly 65 70 75 80 Tyr Pro Trp Pro Leu Tyr Ala Asn Glu Gly Leu GlyTrp Ala Gly Trp 85 90 95 Leu Leu Ser Pro Arg Gly Ser Arg Pro Asn Trp GlyPro Asn Asp Pro 100 105 110 Arg Arg Lys Ser Arg Asn Leu Gly Lys Val IleAsp Thr Leu Thr Cys 115 120 125 Gly Phe Ala Asp Leu Met Gly Tyr Ile ProLeu Val Gly Gly Pro Val 130 135 140 Gly Gly Val Ala Arg Ala Leu Ala HisGly Val Arg Val Leu Glu Asp 145 150 155 160 Gly Val Asn Tyr Pro Thr GlyAsn Leu Pro Gly Cys Ser Phe Ser Ile 165 170 175 Phe Ile Leu Ala Leu LeuSer Cys Leu Thr Val Pro Ala Ser Ala Val 180 185 190 Pro Tyr Arg Asn AlaSer Gly Ile Tyr His Val Thr Asn Asp Cys Pro 195 200 205 Asn Ser Ser IleVal Tyr Glu Ala Asp Asn Leu Ile Leu His Ala Pro 210 215 220 Gly Cys ValPro Cys Val Met Thr Gly Asn Val Ser Arg Cys Trp Val 225 230 235 240 GlnIle Thr Pro Thr Leu Ser Ala Pro Ser Leu Gly Ala Val Thr Ala 245 250 255Pro Leu Arg Arg Ala Val Asp Tyr Leu Ala Gly Gly Ala Ala Leu Cys 260 265270 Ser Ala Leu Tyr Val Gly Asp Ala Cys Gly Ala Leu Phe Leu Val Gly 275280 285 Gln Met Phe Thr Tyr Arg Pro Arg Gln His Ala Thr Val Gln Asn Cys290 295 300 Asn Cys Ser Ile Tyr Ser Gly His Val Thr Gly His Arg Met Ala305 310 315 (2) INFORMATION FOR SEQ ID NO: 53: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 959 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (vii) IMMEDIATE SOURCE: (B) CLONE:PC C/E1 (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 2..959 (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 53: CCATGAGCAC GAATCCTAAA CCTCAAAGAAAAACCAAAAG AAACACCAAC CGTCGCCCAC 60 AGGACGTCAA GTTCCCGGGC GGTGGTCAGATCGTTGGCGG AGTTTACTTG TTGCCGCGCA 120 GGGGCCCTAG GATGGGTGTG CGCGCGACTCGGAAGACTTC GGAACGGTCG CAACCCCGTG 180 GACGGCGTCA GCCTATTCCC AAGGCGCGCCAGCCCACGGG CCGGTCCTGG GGTCAACCCG 240 GGTACCCTTG GCCCCTTTAC GCCAATGAGGGCCTCGGGTG GGCAGGGTGG CTGCTCTCCC 300 CTCGAGGCTC TCGGCCTAAT TGGGGCCCCAATGACCCCCG GCGAAAATCG CGTAATTTGG 360 GTAAGGTCAT CGATACCCTA ACGTGCGGATTCGCCGATCT CATGGGGTAY ATCCCGCTCG 420 TAGGCGGCCC CRTTGGGGGC GTCGCAAGGGCTCTCGCACA CGGTGTGAGG GTCCTTGAGG 480 ACGGGGTAAA CTATSCAACA GGGAATTTACCCGGTTGCTC TTTCTCTATC TTTATTCTTG 540 CTCTTCTCTC GTGTCTGACC GTTCCGGCCTCTGCAGTTCC CTACCGAAAT GCCTCTGGGA 600 TTTATCATGT TACCAATGAT TGCCCAAACTCTTCCATAGT CTATGAGGCA GATAACCTGA 660 TCCTACACGC ACCTGGTTGC GTGCCTTGTGTCATGACAGG TAATGTGAGT AGATGCTGGG 720 TCCAAATTAC CCCTACACTG TCAGCCCCGAGCCTCGGAGC AGTCACGGCT CCTCTTCGGA 780 GAGCCGTTGA CTACCTAGCG GGAGGGGCTGCCCTCTGCTC CGCGTTATAC GTAGGAGACG 840 CGTGTGGGGC ACTATTCTTG GTAGGCCAAATGTTCACCTA TAGGCCTCGC CAGCACGCTA 900 CGGTGCAGAA CTGCAACTGT TCCATTTACAGTGGCCATGT TACCGGCCAC CGGATGGCA 959 (2) INFORMATION FOR SEQ ID NO: 54:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 319 amino acids (B) TYPE:amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 54: Met Ser Thr Asn Pro Lys Pro Gln ArgLys Thr Lys Arg Asn Thr Asn 1 5 10 15 Arg Arg Pro Gln Asp Val Lys PhePro Gly Gly Gly Gln Ile Val Gly 20 25 30 Gly Val Tyr Leu Leu Pro Arg ArgGly Pro Arg Met Gly Val Arg Ala 35 40 45 Thr Arg Lys Thr Ser Glu Arg SerGln Pro Arg Gly Arg Arg Gln Pro 50 55 60 Ile Pro Lys Ala Arg Gln Pro ThrGly Arg Ser Trp Gly Gln Pro Gly 65 70 75 80 Tyr Pro Trp Pro Leu Tyr AlaAsn Glu Gly Leu Gly Trp Ala Gly Trp 85 90 95 Leu Leu Ser Pro Arg Gly SerArg Pro Asn Trp Gly Pro Asn Asp Pro 100 105 110 Arg Arg Lys Ser Arg AsnLeu Gly Lys Val Ile Asp Thr Leu Thr Cys 115 120 125 Gly Phe Ala Asp LeuMet Gly Tyr Ile Pro Leu Val Gly Gly Pro Val 130 135 140 Gly Gly Val AlaArg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 145 150 155 160 Gly ValAsn Tyr Pro Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile 165 170 175 PheIle Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Val 180 185 190Pro Tyr Arg Asn Ala Ser Gly Ile Tyr His Val Thr Asn Asp Cys Pro 195 200205 Asn Ser Ser Ile Val Tyr Glu Ala Asp Asn Leu Ile Leu His Ala Pro 210215 220 Gly Cys Val Pro Cys Val Met Thr Gly Asn Val Ser Arg Cys Trp Val225 230 235 240 Gln Ile Thr Pro Thr Leu Ser Ala Pro Ser Leu Gly Ala ValThr Ala 245 250 255 Pro Leu Arg Arg Ala Val Asp Tyr Leu Ala Gly Gly AlaAla Leu Cys 260 265 270 Ser Ala Leu Tyr Val Gly Asp Ala Cys Gly Ala LeuPhe Leu Val Gly 275 280 285 Gln Met Phe Thr Tyr Arg Pro Arg Gln His AlaThr Val Gln Asn Cys 290 295 300 Asn Cys Ser Ile Tyr Ser Gly His Val ThrGly His Arg Met Ala 305 310 315 (2) INFORMATION FOR SEQ ID NO: 55: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 354 base pairs (B) TYPE: nucleicacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (vii) IMMEDIATE SOURCE:(B) CLONE: PC-1-37 (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 1..354(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: ACCACCGGAG CTTCTATCACATACTCCACT TACGGCAAGT TCCTTGCTGA TGGAGGGTGT 60 TCAGGCGGCG CGCATGACGTGATCATATGC GACGAGTGCC ATTCCCAGGA CGCCACCACC 120 ATTCTTGGGA TAGGCACTGTCCTTGACCAG GCAGAGACGG CTGGAGCTAG GCTCGTCGTC 180 TTGGCCACGG NCACCCCTCCCGGCAGTGTG ACAACGCCCC ACCCCAACAT CGAGGAAGTG 240 GCCCTGCCTC AGGAGGGGGAGGTTCCCTTC TACGGCAGAG CCATTCCCCT TGCTTTTATA 300 AAGGGTGGTA GGCATCTCATCTTCTGCCAT TCCAAGAAAA ATTGTGATGA ACTC 354 (2) INFORMATION FOR SEQ ID NO:56: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 118 amino acids (B) TYPE:amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: Thr Thr Gly AlaSer Ile Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala 1 5 10 15 Asp Gly GlyCys Ser Gly Gly Ala His Asp Val Ile Ile Cys Asp Glu 20 25 30 Cys His SerGln Asp Ala Thr Thr Ile Leu Gly Ile Gly Thr Val Leu 35 40 45 Asp Gln AlaGlu Thr Ala Gly Ala Arg Leu Val Val Leu Ala Thr Xaa 50 55 60 Thr Pro ProGly Ser Val Thr Thr Pro His Pro Asn Ile Glu Glu Val 65 70 75 80 Ala LeuPro Gln Glu Gly Glu Val Pro Phe Tyr Gly Arg Ala Ile Pro 85 90 95 Leu AlaPhe Ile Lys Gly Gly Arg His Leu Ile Phe Cys His Ser Lys 100 105 110 LysAsn Cys Asp Glu Leu 115 (2) INFORMATION FOR SEQ ID NO: 57: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 354 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (vii) IMMEDIATE SOURCE: (B) CLONE:PC-1-48 (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 1..354 (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 57: ACCACCGGAG CTTCTATCAC ATACTCCACTTACGGCAAGT TCCTTGCTGA TGGAGGGTGT 60 TCAGGCGGCG CGTATGACGT GATCATATGCGACGAGTGCC ATTCCCAGGA CGCCACCACC 120 ATTCTTGGGA TAGGCACTGT CCTTGACCAGGCAGAGACGG CTGGAGCTAG GCTCGTCGTC 180 TTGGNCACGG NCACCCCTCC CGGCAGTGTGACAACGCCCC ACCCCAACAT CGAGGAAGTG 240 GCCCTGCCTC AGGAGGGGGA GGTTCCCTTCTACGGNAGAG CCATTCCCCT TGCTTTTATA 300 AAGGGTGGTA GGCATCTCAT CTTCTGCCATTCCAAGAAAA AATGTGATGA ACTT 354 (2) INFORMATION FOR SEQ ID NO: 58: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 133 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: Thr Thr Gly Ala SerIle Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala 1 5 10 15 Asp Gly Gly CysSer Gly Gly Ala Tyr Asp Val Ile Ile Cys Asp Glu 20 25 30 Cys His Ser GlnAsp Ala Thr Thr Ile Leu Gly Ile Gly Thr Val Leu 35 40 45 Asp Gln Ala GluThr Ala Gly Ala Arg Leu Val Val Leu Xaa Thr Xaa 50 55 60 Thr Pro Pro GlySer Val Thr Thr Pro His Pro Asn Ile Glu Glu Val 65 70 75 80 Ala Leu ProGln Glu Gly Glu Val Pro Phe Tyr Xaa Arg Ala Ile Pro 85 90 95 Leu Ala PheIle Lys Gly Gly Arg His Leu Ile Phe Cys His Ser Lys 100 105 110 Lys LysCys Asp Glu Leu Arg Gln Ala Thr Asp Gln Pro Gly Arg Glu 115 120 125 ArgPro Trp Glu Tyr 130 (2) INFORMATION FOR SEQ ID NO: 59: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 357 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (vii) IMMEDIATE SOURCE: (B) CLONE:PC-1-37 (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 1..357 (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 59: ATGGCTTTCA TGTCTCCGGA CTTGGAGGTCATTACCANCA CTTGGGTTCT GGTGGGGGGC 60 GTTGTGGCGA CCCTGNCGNC CTACTGCTTGACGGTGGGTT CGGTAGCCAT AGTCGGTAGG 120 ATCATCCTCT CTGGGAAACC TGCCATCATTNCCGATAGGG AGGTATTATA CCAGCAATTT 180 GATGAGATGG AGGAGTGCTC GGCCTCGTTGCCCTATATGG ACGAAACACG TNCCATTGCC 240 GGACAATTCA AAGAGAAAGT GCTCGGCTTCATCAGCACGA CCGGCCAGAA GGCTGAAACT 300 CTGAAGCCGG CAGCCACGTC TGTGTGGAACAAGGCTGATC AGTTCTGGNC CACATAC 357 (2) INFORMATION FOR SEQ ID NO: 60: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 128 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: Met Ala Phe Met SerPro Asp Leu Glu Val Ile Thr Xaa Thr Trp Val 1 5 10 15 Leu Val Gly GlyVal Val Ala Thr Leu Xaa Xaa Tyr Cys Leu Thr Val 20 25 30 Gly Ser Val AlaIle Val Gly Arg Ile Ile Leu Ser Gly Lys Pro Ala 35 40 45 Ile Ile Xaa AspArg Glu Val Leu Tyr Gln Gln Phe Asp Glu Met Glu 50 55 60 Glu Cys Ser AlaSer Leu Pro Tyr Met Asp Glu Thr Arg Xaa Ile Ala 65 70 75 80 Gly Gln PheLys Glu Lys Val Leu Gly Phe Ile Ser Thr Thr Gly Gln 85 90 95 Lys Ala GluThr Leu Lys Pro Ala Ala Thr Ser Val Trp Asn Lys Ala 100 105 110 Asp GlnPhe Trp Xaa Thr Tyr Met Trp Asn Phe Ile Ser Gly Ile Gln 115 120 125 (2)INFORMATION FOR SEQ ID NO: 61: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH:357 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iii)ANTI-SENSE: NO (vii) IMMEDIATE SOURCE: (B) CLONE: PC-1-48 (ix) FEATURE:(A) NAME/KEY: CDS (B) LOCATION: 1..357 (xi) SEQUENCE DESCRIPTION: SEQ IDNO: 61: ATGGCTTGCA TGTCTGCGGA CCTGGAGGTC ATTACCANCA CTTGGGTTCTGGTGGGGGGC 60 GTTGTGGCGN CCCTGGCGGC CTACTGCTTG ACGGTGGGTT CGGTAGCCATAGTCGGTAGG 120 ATCATCCTCT CTGGGAAACC TGCCATCATT CCCGATAGGG AGGCATTATACCANCAATTT 180 GATGAGATGG AGGAGTGCTC GGCCTCGTTG CCCTATATGG ACGAGACACGTGCCATTGCC 240 GGACAATTCA AAGAGAAAGT GCTCGGCTTC ATCAGCACGA CCGGCCAGAAGGCTGAAACT 300 CTGAAGCCGG CAGCCACGTC TGTGTGGAAC AAGGCTGANC AGTTCTGGGCCACATAC 357 (2) INFORMATION FOR SEQ ID NO: 62: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 128 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: Met Ala Cys Met Ser Ala AspLeu Glu Val Ile Thr Xaa Thr Trp Val 1 5 10 15 Leu Val Gly Gly Val ValAla Xaa Leu Ala Ala Tyr Cys Leu Thr Val 20 25 30 Gly Ser Val Ala Ile ValGly Arg Ile Ile Leu Ser Gly Lys Pro Ala 35 40 45 Ile Ile Pro Asp Arg GluAla Leu Tyr Xaa Gln Phe Asp Glu Met Glu 50 55 60 Glu Cys Ser Ala Ser LeuPro Tyr Met Asp Glu Thr Arg Ala Ile Ala 65 70 75 80 Gly Gln Phe Lys GluLys Val Leu Gly Phe Ile Ser Thr Thr Gly Gln 85 90 95 Lys Ala Glu Thr LeuLys Pro Ala Ala Thr Ser Val Trp Asn Lys Ala 100 105 110 Xaa Gln Phe TrpAla Thr Tyr Met Trp Asn Phe Ile Ser Gly Ile Gln 115 120 125 (2)INFORMATION FOR SEQ ID NO: 63: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH:28 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:YES (iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY: misc_feature (B)LOCATION: 1..28 (D) OTHER INFORMATION: /standard_name= “HCV PrimerHCPr161” (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: ACCGGAGGCC AGGAGAGTGATCTCCTCC 28 (2) INFORMATION FOR SEQ ID NO: 64: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 28 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA(genomic) (iii) HYPOTHETICAL: YES (iii) ANTI-SENSE: YES (ix) FEATURE:(A) NAME/KEY: misc_feature (B) LOCATION: 1..28 (D) OTHER INFORMATION:/standard_name= “HCV Primer HCPr162” (xi) SEQUENCE DESCRIPTION: SEQ IDNO: 64: GGGCTGCTCT ATCCTCATCG ACGCCATC 28 (2) INFORMATION FOR SEQ ID NO:65: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 28 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: DNA (genomic) (iii) HYPOTHETICAL: YES (iii) ANTI-SENSE: NO (ix)FEATURE: (A) NAME/KEY: misc_feature (B) LOCATION: 1..28 (D) OTHERINFORMATION: /standard_name= “HCV Primer HCPr163” (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 65: GCCAGAGGCT CGGAAGGCGA TCAGCGCT 28 (2)INFORMATION FOR SEQ ID NO: 66: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH:28 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:YES (iii) ANTI-SENSE: YES (ix) FEATURE: (A) NAME/KEY: misc_feature (B)LOCATION: 1..28 (D) OTHER INFORMATION: /standard_name= “HCV PrimerHCPr164” (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: GAGCTGCTCT GTCCTCCTCGACGCCGCA 28 (2) INFORMATION FOR SEQ ID NO: 67: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA(genomic) (iii) HYPOTHETICAL: YES (iii) ANTI-SENSE: NO (ix) FEATURE: (A)NAME/KEY: misc_feature (B) LOCATION: 1..28 (D) OTHER INFORMATION:/standard_name= “HCV Primer HCPr23” (xi) SEQUENCE DESCRIPTION: SEQ IDNO: 67: CTCATGGGGT ACATTCCGCT 20 (2) INFORMATION FOR SEQ ID NO: 68: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 base pairs (B) TYPE: nucleicacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:DNA (genomic) (iii) HYPOTHETICAL: YES (iii) ANTI-SENSE: YES (ix)FEATURE: (A) NAME/KEY: misc_feature (B) LOCATION: 1..28 (D) OTHERINFORMATION: /standard_name= “HCV Primer HCPr54” (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 68: CTATTACCAG TTCATCATCA TATCCCA 27 (2)INFORMATION FOR SEQ ID NO: 69: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH:24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:YES (iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY: misc_feature (B)LOCATION: 1..28 (D) OTHER INFORMATION: /standard_name= “HCV PrimerHCPr116” (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: TTTTAAATAC ATCATGRCTGYATG 24 (2) INFORMATION FOR SEQ ID NO: 70: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 33 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS:single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (iii)HYPOTHETICAL: YES (iii) ANTI-SENSE: YES (ix) FEATURE: (A) NAME/KEY:misc_feature (B) LOCATION: 1..28 (D) OTHER INFORMATION: /standard_name=“HCV Primer HCPr66” (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: CTATTATTGTATCCCRCTGA TGAARTTCCA CAT 33 (2) INFORMATION FOR SEQ ID NO: 71: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 36 base pairs (B) TYPE: nucleicacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:DNA (genomic) (iii) HYPOTHETICAL: YES (iii) ANTI-SENSE: YES (ix)FEATURE: (A) NAME/KEY: misc_feature (B) LOCATION: 1..28 (D) OTHERINFORMATION: /standard_name= “HCV Primer HCPr118: (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 71: ACTAGTCGAC TAYTGATCCR CTATRWARTT CCACAT 36(2) INFORMATION FOR SEQ ID NO: 72: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 25 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single(D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (iii)HYPOTHETICAL: YES (iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY:misc_feature (B) LOCATION: 1..28 (D) OTHER INFORMATION: /standard_name=”HCV Primer HCPr117: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72:TTTTAAATAC ATCGCRCTGC ATGCA 25 (2) INFORMATION FOR SEQ ID NO: 73: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 36 base pairs (B) TYPE: nucleicacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:DNA (genomic) (iii) HYPOTHETICAL: YES (iii) ANTI-SENSE: YES (ix)FEATURE: (A) NAME/KEY: misc_feature (B) LOCATION: 1..28 (D) OTHERINFORMATION: /standard_name= “HCV Primer HCPr119: (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 73: ACTAGTCGAC TARTTGCATA GCCKRTTCAT CCAYTG 36(2) INFORMATION FOR SEQ ID NO: 74: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 34 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single(D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (iii)HYPOTHETICAL: YES (iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY:misc_feature (B) LOCATION: 1..28 (D) OTHER INFORMATION: /standard_name=”HCV Primer HCPr131: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74:GGAATTCTAG ACCTCTGGGA YGARAYTGGA ARTG 34 (2) INFORMATION FOR SEQ ID NO:75: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 31 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: DNA (genomic) (iii) HYPOTHETICAL: YES (iii) ANTI-SENSE: NO (ix)FEATURE: (A) NAME/KEY: misc_feature (B) LOCATION: 1..28 (D) OTHERINFORMATION: /standard_name= “HCV Primer HCPr130: (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 75: GGAATTCTAG ACGCTAYCAR GCACGTTGYG C 31 (2)INFORMATION FOR SEQ ID NO: 76: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH:23 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:YES (iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY: misc_feature (B)LOCATION: 1..28 (D) OTHER INFORMATION: /standard_name= ”HCV PrimerHCPr134: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: CATATAGATG CCCACTTCCTATC 23 (2) INFORMATION FOR SEQ ID NO: 77: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 16 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS:single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (iii)HYPOTHETICAL: YES (iii) ANTI-SENSE: YES (ix) FEATURE: (A) NAME/KEY:misc_feature (B) LOCATION: 1..28 (D) OTHER INFORMATION: /standard_name=“HCV Primer HCPr3: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: GTGTGCCAGGACCATC 16 (2) INFORMATION FOR SEQ ID NO: 78: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA(genomic) (iii) HYPOTHETICAL: YES (iii) ANTI-SENSE: YES (ix) FEATURE:(A) NAME/KEY: misc_feature (B) LOCATION: 1..28 (D) OTHER INFORMATION:/standard_name= ”HCV Primer HCPr4: (xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: GACATGCATG TCATGATGTA 20 (2) INFORMATION FOR SEQ ID NO: 79: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 29 base pairs (B) TYPE: nucleicacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:DNA (genomic) (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE:(A) NAME/KEY: misc_feature (B) LOCATION: 1..28 (D) OTHER INFORMATION:/standard_name= “HCV Primer HCPr152: (xi) SEQUENCE DESCRIPTION: SEQ IDNO: 79: TACGCCTCTT CTATATCGGT TGGGGCCTG 29 (2) INFORMATION FOR SEQ IDNO: 80: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 26 base pairs (B)TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii)MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: YES (iii) ANTI-SENSE:NO (ix) FEATURE: (A) NAME/KEY: misc_feature (B) LOCATION: 1..28 (D)OTHER INFORMATION: /standard_name= ”HCV Primer HCPr52: (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 80: ATGTTGGGTA AGGTCATCGA TACCCT 26 (2)INFORMATION FOR SEQ ID NO: 81: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH:25 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:YES (iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY: misc_feature (B)LOCATION: 1..28 (D) OTHER INFORMATION: /standard_name= “HCV PrimerHCPr41: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: CCCGGGAGGT CTCGTAGACCGTGCA 25 (2) INFORMATION FOR SEQ ID NO: 82: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 29 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA(genomic) (iii) HYPOTHETICAL: YES (iii) ANTI-SENSE: YES (ix) FEATURE:(A) NAME/KEY: misc_feature (B) LOCATION: 1..28 (D) OTHER INFORMATION:/standard_name= ”HCV Primer HCPr40: (xi) SEQUENCE DESCRIPTION: SEQ IDNO: 82: CTATTAAAGA TAGAGAAAGA GCAACCGGG 29 (2) INFORMATION FOR SEQ IDNO: 83: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 amino acids (B)TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii)MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (viii) POSITION IN GENOME:(B) MAP POSITION: positions 192 to 203 of the V1 region of HCV type 3(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: Leu Glu Trp Arg Asn Thr SerGly Leu Tyr Val Leu 1 5 10 (2) INFORMATION FOR SEQ ID NO: 84: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (iii) HYPOTHETICAL: NO (viii) POSITION IN GENOME: (B) MAPPOSITION: positions 192 to 203 of the V1 region of HCV type 5 (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 84: Val Pro Tyr Arg Asn Ala Ser Gly IleTyr His Val 1 5 10 (2) INFORMATION FOR SEQ ID NO: 85: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 11 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide(iii) HYPOTHETICAL: NO (viii) POSITION IN GENOME: (B) MAP POSITION:positions 213 to 223 of the V2 region ofHCV type 3 (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 85: Val Tyr Glu Ala Asp Asp Val Ile Leu His Thr1 5 10 (2) INFORMATION FOR SEQ ID NO: 86: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 11 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single(D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO(viii) POSITION IN GENOME: (B) MAP POSITION: positions 213 to 233 of theV2 region of HCV type 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: ValTyr Glu Ala Asp Asn Leu Ile Leu His Ala 1 5 10 (2) INFORMATION FOR SEQID NO: 87: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 13 amino acids (B)TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii)MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (viii) POSITION IN GENOME:(B) MAP POSITION: positions 230 to 242 of the V3 region of HCV type 3(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: Val Gln Asp Gly Asn Thr SerThr Cys Trp Thr Pro Val 1 5 10 (2) INFORMATION FOR SEQ ID NO: 88: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 13 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (iii) HYPOTHETICAL: NO (viii) POSITION IN GENOME: (B) MAPPOSITION: positions 230 to 242 of the V3 region of HCV type 5 (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 88: Val Met Thr Gly Asn Val Ser Arg CysTrp Val Gln Ile 1 5 10 (2) INFORMATION FOR SEQ ID NO: 89: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 10 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide(iii) HYPOTHETICAL: NO (viii) POSITION IN GENOME: B) MAP POSITION:positions 248 to 257 of the V4 region of HCV type 3 (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 89: Val Arg Tyr Val Gly Ala Thr Thr Ala Ser 1 510 (2) INFORMATION FOR SEQ ID NO: 90: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 10 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO(viii) POSITION IN GENOME: (B) MAP POSITION: positions 248 to 257 of theV4 region of HCV type 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: AlaPro Ser Leu Gly Ala Val Thr Ala Pro 1 5 10 (2) INFORMATION FOR SEQ IDNO: 91: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 10 amino acids (B)TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii)MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (viii) POSITION IN GENOME:(B) MAP POSITION: positions 294 to 303 of the V5 region of HCV type 3(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: Arg Pro Arg Arg His Gln ThrVal Gln Thr 1 5 10 (2) INFORMATION FOR SEQ ID NO: 92: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 10 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide(iii) HYPOTHETICAL: NO (viii) POSITION IN GENOME: (B) MAP POSITION:positions 294 to 303 of the V5 region of HCV type 5 (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 92: Arg Pro Arg Gln His Ala Thr Val Gln Asn 1 510 (2) INFORMATION FOR SEQ ID NO: 93: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 9 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO(viii) POSITION IN GENOME: (B) MAP POSITION: positions 70 to 78 of HCVtype 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: Gln Pro Thr Gly Arg SerTrp Gly Gln 1 5 (2) INFORMATION FOR SEQ ID NO: 94: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 8 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide(iii) HYPOTHETICAL: NO (vi) ORIGINAL SOURCE: (C) INDIVIDUAL ISOLATE:BR33 and BR36 (viii) POSITION IN GENOME: (B) MAP POSITION: positions 230to 237 of the V3 region of HCV type 3 (xi) SEQUENCE DESCRIPTION: SEQ IDNO: 94: Val Gln Asp Gly Asn Thr Ser Thr 1 5 (2) INFORMATION FOR SEQ IDNO: 95: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 8 amino acids (B)TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii)MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (vi) ORIGINAL SOURCE: (C)INDIVIDUAL ISOLATE: HD10 (viii) POSITION IN GENOME: (B) MAP POSITION:positions 230 to 237 of the V3 region of HCV type 3 (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 95: Val Gln Asp Gly Asn Thr Ser Ala 1 5 (2)INFORMATION FOR SEQ ID NO: 96: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH:10 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (vi)ORIGINAL SOURCE: (C) INDIVIDUAL ISOLATE: BR36 (viii) POSITION IN GENOME:(B) MAP POSITION: positions 248 to 257 of the V4 region of HCV type 3(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: Val Lys Tyr Val Gly Ala ThrThr Ala Ser 1 5 10 (2) INFORMATION FOR SEQ ID NO: 97: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 20 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide(iii) HYPOTHETICAL: NO (vi) ORIGINAL SOURCE: (C) INDIVIDUAL ISOLATE:BR36 (viii) POSITION IN GENOME: (B) MAP POSITION: Positions 1688 to 1707of HCV type 3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: Leu Gly Gly LysPro Ala Ile Val Pro Asp Lys Glu Val Leu Tyr Gln 1 5 10 15 Gln Tyr AspGlu 20 (2) INFORMATION FOR SEQ ID NO: 98: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 20 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single(D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO(vi) ORIGINAL SOURCE: (C) INDIVIDUAL ISOLATE: HD10 (viii) POSITION INGENOME: (B) MAP POSITION: positions 1688 to 1707 of HCV type 3 (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 98: Leu Gly Gly Lys Pro Ala Leu Val ProAsp Lys Glu Val Leu Tyr Gln 1 5 10 15 Gln Tyr Asp Glu 20 (2) INFORMATIONFOR SEQ ID NO: 99: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 aminoacids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear(ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (viii) POSITION INGENOME: (B) MAP POSITION: positions 1712 to 1731 (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 99: Ser Gln Ala Ala Pro Tyr Ile Glu Gln Ala GlnVal Ile Ala His Gln 1 5 10 15 Phe Lys Glu Lys 20 (2) INFORMATION FOR SEQID NO: 100: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 amino acids (B)TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii)MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (vi) ORIGINAL SOURCE: (C)INDIVIDUAL ISOLATE: BR36 (viii) POSITION IN GENOME: (B) MAP POSITION:positions 1724 to 1743 of HCV type 3 (xi) SEQUENCE DESCRIPTION: SEQ IDNO: 100: Ile Ala His Gln Phe Lys Glu Lys Val Leu Gly Leu Leu Gln Arg Ala1 5 10 15 Thr Gln Gln Gln 20 (2) INFORMATION FOR SEQ ID NO: 101: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (iii) HYPOTHETICAL: NO (vi) ORIGINAL SOURCE: (C) INDIVIDUALISOLATE: HD10 (viii) POSITION IN GENOME: (B) MAP POSITION: positions1724 to 1743 of HCV type 3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101:Ile Ala His Gln Phe Lys Glu Lys Ile Leu Gly Leu Leu Gln Arg Ala 1 5 1015 Thr Gln Gln Gln 20 (2) INFORMATION FOR SEQ ID NO: 102: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 20 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide(iii) HYPOTHETICAL: NO (viii) POSITION IN GENOME: (B) MAP POSITION:positions 1688 to 1707 of HCV type 5 (xi) SEQUENCE DESCRIPTION: SEQ IDNO: 102: Leu Ser Gly Lys Pro Ala Ile Ile Pro Asp Arg Glu Ala Leu Tyr Gln1 5 10 15 Gln Phe Asp Glu 20 (2) INFORMATION FOR SEQ ID NO: 103: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (iii) HYPOTHETICAL: NO (viii) POSITION IN GENOME: (B) MAPPOSITION: positions 1688 to 1707 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:103: Leu Ser Gly Lys Pro Ala Ile Ile Pro Asp Arg Glu Val Leu Tyr Gln 1 510 15 Gln Phe Asp Glu 20 (2) INFORMATION FOR SEQ ID NO: 104: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (iii) HYPOTHETICAL: NO (viii) POSITION IN GENOME: (B) MAPPOSITION: position 1712 to 1731 of HCV type 5 (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 104: Ser Ala Ser Leu Pro Tyr Met Asp Glu Thr Arg Ala Ile AlaGly Gln 1 5 10 15 Phe Lys Glu Lys 20 (2) INFORMATION FOR SEQ ID NO: 105:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (iii) HYPOTHETICAL: NO (viii) POSITION IN GENOME: (B) MAPPOSITION: positions 1724 to 1743 of HCV type 5 (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 105: Ile Ala Gly Gln Phe Lys Glu Lys Val Leu GlyPhe Ile Ser Thr Thr 1 5 10 15 Gly Gln Lys Ala 20 (2) INFORMATION FOR SEQID NO: 106: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 340 base pairs (B)TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii)MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (vii)IMMEDIATE SOURCE: (B) CLONE: GB48-3-10 (ix) FEATURE: (A) NAME/KEY: CDS(B) LOCATION: 2..340 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: C TCCACT GTA ACC GAA AAG GAC ATC AGG GTC GAG GAG GAG GTC TAT 46 Ser Thr ValThr Glu Lys Asp Ile Arg Val Glu Glu Glu Val Tyr 1 5 10 15 CAG TGT TGTGAC CTG GAG CCC GAA GCC CGC AAG GCA ATT ACC GCC CTA 94 Gln Cys Cys AspLeu Glu Pro Glu Ala Arg Lys Ala Ile Thr Ala Leu 20 25 30 ACA GAG AGA CTCTAC GTG GGC GGT CCC ATG CAT AAC AGC AAG GGA GAC 142 Thr Glu Arg Leu TyrVal Gly Gly Pro Met His Asn Ser Lys Gly Asp 35 40 45 CTG TGC GGG TAT CGCAGA TGT CGC GCA AGC GGC GTC TAC ACC ACC AGC 190 Leu Cys Gly Tyr Arg ArgCys Arg Ala Ser Gly Val Tyr Thr Thr Ser 50 55 60 TTC GGG AAC ACA CTG ACGTGC TAC CTC AAA GCC TCA GCC GCT ATC AAA 238 Phe Gly Asn Thr Leu Thr CysTyr Leu Lys Ala Ser Ala Ala Ile Lys 65 70 75 GCG GCG GGG CTG AGA GAC TGCACC ATG TTG GTC TGT GGT GAT GAC CTG 286 Ala Ala Gly Leu Arg Asp Cys ThrMet Leu Val Cys Gly Asp Asp Leu 80 85 90 95 GTT GTC ATC GCT GAG AGC GATGGC GTA GAG GAG GAC AAA CGA CCC CTC 334 Val Val Ile Ala Glu Ser Asp GlyVal Glu Glu Asp Lys Arg Pro Leu 100 105 110 GGA GCC 340 Gly Ala (2)INFORMATION FOR SEQ ID NO: 107: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 113 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii)MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: SerThr Val Thr Glu Lys Asp Ile Arg Val Glu Glu Glu Val Tyr Gln 1 5 10 15Cys Cys Asp Leu Glu Pro Glu Ala Arg Lys Ala Ile Thr Ala Leu Thr 20 25 30Glu Arg Leu Tyr Val Gly Gly Pro Met His Asn Ser Lys Gly Asp Leu 35 40 45Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser Phe 50 55 60Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala Ile Lys Ala 65 70 7580 Ala Gly Leu Arg Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu Val 85 9095 Val Ile Ala Glu Ser Asp Gly Val Glu Glu Asp Lys Arg Pro Leu Gly 100105 110 Ala (2) INFORMATION FOR SEQ ID NO: 108: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 340 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (vii) IMMEDIATE SOURCE: (B) CLONE:GB116-3-5 (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 2..340 (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 108: C TCC ACT GTA ACC GAA AAG GAC ATCAGG GTC GAG GAG GAG GTA TAT 46 Ser Thr Val Thr Glu Lys Asp Ile Arg ValGlu Glu Glu Val Tyr 1 5 10 15 CAG TGT TGT GAC CTG GAG CCC GAG GCC CGCAGA GCA ATT ACC GCC CTA 94 Gln Cys Cys Asp Leu Glu Pro Glu Ala Arg ArgAla Ile Thr Ala Leu 20 25 30 ACA GAG AGA CTC TAC GTG GGC GGT CCC ATG CATAAC AGC AGG GGA GAC 142 Thr Glu Arg Leu Tyr Val Gly Gly Pro Met His AsnSer Arg Gly Asp 35 40 45 CTG TGC GGG TAT CGC AGA TGC CGT GCG AGC GGC GTCTAC ACC ACC AGC 190 Leu Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val TyrThr Thr Ser 50 55 60 TTC GGG AAC ACA CTG ACG TGC TAT CTC AAA GCC TCA GCCGCT ATC AGA 238 Phe Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala AlaIle Arg 65 70 75 GCG GCG GGG CTG AGA GAC TGC ACC ATG TTG GTC TGT GGT GATGAC CTG 286 Ala Ala Gly Leu Arg Asp Cys Thr Met Leu Val Cys Gly Asp AspLeu 80 85 90 95 GTC GTC ATT GCT GAA AGC GAT GGC GTA GAG GAG GAC AAA CGAGCC CTC 334 Val Val Ile Ala Glu Ser Asp Gly Val Glu Glu Asp Lys Arg AlaLeu 100 105 110 GGA GCC 340 Gly Ala (2) INFORMATION FOR SEQ ID NO: 109:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 113 amino acids (B) TYPE:amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 109: Ser Thr Val Thr Glu Lys Asp IleArg Val Glu Glu Glu Val Tyr Gln 1 5 10 15 Cys Cys Asp Leu Glu Pro GluAla Arg Arg Ala Ile Thr Ala Leu Thr 20 25 30 Glu Arg Leu Tyr Val Gly GlyPro Met His Asn Ser Arg Gly Asp Leu 35 40 45 Cys Gly Tyr Arg Arg Cys ArgAla Ser Gly Val Tyr Thr Thr Ser Phe 50 55 60 Gly Asn Thr Leu Thr Cys TyrLeu Lys Ala Ser Ala Ala Ile Arg Ala 65 70 75 80 Ala Gly Leu Arg Asp CysThr Met Leu Val Cys Gly Asp Asp Leu Val 85 90 95 Val Ile Ala Glu Ser AspGly Val Glu Glu Asp Lys Arg Ala Leu Gly 100 105 110 Ala (2) INFORMATIONFOR SEQ ID NO: 110: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 340 basepairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY:linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE:NO (vii) IMMEDIATE SOURCE: (B) CLONE: GB215-3-8 (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 2..340 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:110: C TCC ACT GTA ACC GAA AAA GAC ATC AGG GTC GAG GAG GAG GTA TAT 46Ser Thr Val Thr Glu Lys Asp Ile Arg Val Glu Glu Glu Val Tyr 1 5 10 15CAG TGT TGT GAC CTG GAG CCC GAA GCC CGC AAG GTA ATT ACC GCC CTA 94 GlnCys Cys Asp Leu Glu Pro Glu Ala Arg Lys Val Ile Thr Ala Leu 20 25 30 ACAGAG AGA CTC TAT GTG GGC GGT CCC ATG CAT AAT AGC AAA GGA GAC 142 Thr GluArg Leu Tyr Val Gly Gly Pro Met His Asn Ser Lys Gly Asp 35 40 45 CTG TGCGGG TAT CGC AGA TGC CGC GCA AGC GGC GTC TAC ACC ACC AGC 190 Leu Cys GlyTyr Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser 50 55 60 TTC GGG AACACA CTG ACG TGC TAT CTC AAA GCC TCA GCC GCC ATC AGG 238 Phe Gly Asn ThrLeu Thr Cys Tyr Leu Lys Ala Ser Ala Ala Ile Arg 65 70 75 GCG TCA GGG CTGAGA GAC TGC ACT ATG CTG GTC TAT GGT GAC GAC CTG 286 Ala Ser Gly Leu ArgAsp Cys Thr Met Leu Val Tyr Gly Asp Asp Leu 80 85 90 95 GTC GTC ATT GCCGAG AGC GAT GGC GTA GAG GAG GAC AAA CGA GCC CTC 334 Val Val Ile Ala GluSer Asp Gly Val Glu Glu Asp Lys Arg Ala Leu 100 105 110 GGA GTC 340 GlyVal (2) INFORMATION FOR SEQ ID NO: 111: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 113 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111:Ser Thr Val Thr Glu Lys Asp Ile Arg Val Glu Glu Glu Val Tyr Gln 1 5 1015 Cys Cys Asp Leu Glu Pro Glu Ala Arg Lys Val Ile Thr Ala Leu Thr 20 2530 Glu Arg Leu Tyr Val Gly Gly Pro Met His Asn Ser Lys Gly Asp Leu 35 4045 Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser Phe 50 5560 Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala Ile Arg Ala 65 7075 80 Ser Gly Leu Arg Asp Cys Thr Met Leu Val Tyr Gly Asp Asp Leu Val 8590 95 Val Ile Ala Glu Ser Asp Gly Val Glu Glu Asp Lys Arg Ala Leu Gly100 105 110 Val (2) INFORMATION FOR SEQ ID NO: 112: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 340 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (vii) IMMEDIATE SOURCE: (B) CLONE:GB358-3-3 (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 2..340 (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 112: C TCC ACT GTA ACC GAA AAG GAC ATCAGG GTC GAG GAG GAG GTG TAT 46 Ser Thr Val Thr Glu Lys Asp Ile Arg ValGlu Glu Glu Val Tyr 1 5 10 15 CAG TGT TGT GAC CTG GAG CCC GAG GCC CGCAAG GCA ATT ACT GCC CTA 94 Gln Cys Cys Asp Leu Glu Pro Glu Ala Arg LysAla Ile Thr Ala Leu 20 25 30 ACA GAG AGA CTC TAT GTG GGC GGT CCC ATG CATAAC AGC AAG GGA GAC 142 Thr Glu Arg Leu Tyr Val Gly Gly Pro Met His AsnSer Lys Gly Asp 35 40 45 CTG TGT GGG TAT CGC AGA TGC CGC GCA AGC GGC GTCTAC ACC ACC AGC 190 Leu Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val TyrThr Thr Ser 50 55 60 TTC GGG AAC ACA CTG ACG TGC TAC CTC AAA GCC TCA GCCGCT ATC AGA 238 Phe Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala AlaIle Arg 65 70 75 GCG GCG GGG CTG AGA GAC TGC ACC ATG TTG GTC TGT GGT GATGAC CTG 286 Ala Ala Gly Leu Arg Asp Cys Thr Met Leu Val Cys Gly Asp AspLeu 80 85 90 95 GTC GTC ATC GCT GAG AGC GAT GGC GTT GAG GAG GAC AAA CGAGCC CTC 334 Val Val Ile Ala Glu Ser Asp Gly Val Glu Glu Asp Lys Arg AlaLeu 100 105 110 GGA GCC 340 Gly Ala (2) INFORMATION FOR SEQ ID NO: 113:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 113 amino acids (B) TYPE:amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 113: Ser Thr Val Thr Glu Lys Asp IleArg Val Glu Glu Glu Val Tyr Gln 1 5 10 15 Cys Cys Asp Leu Glu Pro GluAla Arg Lys Ala Ile Thr Ala Leu Thr 20 25 30 Glu Arg Leu Tyr Val Gly GlyPro Met His Asn Ser Lys Gly Asp Leu 35 40 45 Cys Gly Tyr Arg Arg Cys ArgAla Ser Gly Val Tyr Thr Thr Ser Phe 50 55 60 Gly Asn Thr Leu Thr Cys TyrLeu Lys Ala Ser Ala Ala Ile Arg Ala 65 70 75 80 Ala Gly Leu Arg Asp CysThr Met Leu Val Cys Gly Asp Asp Leu Val 85 90 95 Val Ile Ala Glu Ser AspGly Val Glu Glu Asp Lys Arg Ala Leu Gly 100 105 110 Ala (2) INFORMATIONFOR SEQ ID NO: 114: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 340 basepairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY:linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE:NO (vii) IMMEDIATE SOURCE: (B) CLONE: GB549-3-6 (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 2..340 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:114: C TCC ACG GTG ACC GAA AGG GAT ATC AGG ACC GAG GAA GAG ATC TAC 46Ser Thr Val Thr Glu Arg Asp Ile Arg Thr Glu Glu Glu Ile Tyr 1 5 10 15CAG TGC TGC GAC CTG GAG CCC GAA GCC CGC AAG GTG ATA TCC GCC CTA 94 GlnCys Cys Asp Leu Glu Pro Glu Ala Arg Lys Val Ile Ser Ala Leu 20 25 30 ACGGAA AGA CTC TAC GTG GGC GGT CCC ATG TAC AAC TCC AAG GGG GAC 142 Thr GluArg Leu Tyr Val Gly Gly Pro Met Tyr Asn Ser Lys Gly Asp 35 40 45 CTA TGCGGG CAA CGG AGG TGC CGC GCA AGC GGG GTC TAC ACC ACC AGC 190 Leu Cys GlyGln Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser 50 55 60 TTC GGG AACACT GTA ACG TGT TAT CTC AAG GCC GTT GCG GCT ACT AGG 238 Phe Gly Asn ThrVal Thr Cys Tyr Leu Lys Ala Val Ala Ala Thr Arg 65 70 75 GCC GCA GGT CTGAAA GGT TGC AGC ATG CTG GTT TGT GGA GAC GAC TTA 286 Ala Ala Gly Leu LysGly Cys Ser Met Leu Val Cys Gly Asp Asp Leu 80 85 90 95 GTC GTC ATC TGCGAG AGC GGC GGC GTA GAG GAG GAT GCA AGA GCC CTC 334 Val Val Ile Cys GluSer Gly Gly Val Glu Glu Asp Ala Arg Ala Leu 100 105 110 CGA GCC 340 ArgAla (2) INFORMATION FOR SEQ ID NO: 115: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 113 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115:Ser Thr Val Thr Glu Arg Asp Ile Arg Thr Glu Glu Glu Ile Tyr Gln 1 5 1015 Cys Cys Asp Leu Glu Pro Glu Ala Arg Lys Val Ile Ser Ala Leu Thr 20 2530 Glu Arg Leu Tyr Val Gly Gly Pro Met Tyr Asn Ser Lys Gly Asp Leu 35 4045 Cys Gly Gln Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser Phe 50 5560 Gly Asn Thr Val Thr Cys Tyr Leu Lys Ala Val Ala Ala Thr Arg Ala 65 7075 80 Ala Gly Leu Lys Gly Cys Ser Met Leu Val Cys Gly Asp Asp Leu Val 8590 95 Val Ile Cys Glu Ser Gly Gly Val Glu Glu Asp Ala Arg Ala Leu Arg100 105 110 Ala (2) INFORMATION FOR SEQ ID NO: 116: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 340 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (vii) IMMEDIATE SOURCE: (B) CLONE:GB809-3-1 (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 2..340 (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 116: C TCC ACT GTG ACT GAG AGA GAC ATCAAG GTC GAA GAA GAA GTC TAT 46 Ser Thr Val Thr Glu Arg Asp Ile Lys ValGlu Glu Glu Val Tyr 1 5 10 15 CAG TGT TGT GAT CTG GAG CCC GAG GCC CGCAAG GTA ATA GCC GCC CTC 94 Gln Cys Cys Asp Leu Glu Pro Glu Ala Arg LysVal Ile Ala Ala Leu 20 25 30 ACG GAG AGA CTC TAC GTG GGC GGC CCC ATG CATAAC AGC AAG GGA GAC 142 Thr Glu Arg Leu Tyr Val Gly Gly Pro Met His AsnSer Lys Gly Asp 35 40 45 CTT TGC GGG TAT CGT AGA TGC CGC GCG AGC GGC GTATAC ACC ACC AGC 190 Leu Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val TyrThr Thr Ser 50 55 60 TTC GGG AAC ACA ATG ACG TGC TAC CTT AAG GCC TCA GCAGCC ATC AGG 238 Phe Gly Asn Thr Met Thr Cys Tyr Leu Lys Ala Ser Ala AlaIle Arg 65 70 75 GCT GCG GGG CTA AAG GAT TGC ACC ATG CTG GTT TGC GGT GACGAC CTA 286 Ala Ala Gly Leu Lys Asp Cys Thr Met Leu Val Cys Gly Asp AspLeu 80 85 90 95 GTC GTG ATC GCC GAG AGC GGT GGC GTT GAG GAG GAC AAA CGAGCC CTC 334 Val Val Ile Ala Glu Ser Gly Gly Val Glu Glu Asp Lys Arg AlaLeu 100 105 110 GGA GCT 340 Gly Ala (2) INFORMATION FOR SEQ ID NO: 117:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 113 amino acids (B) TYPE:amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 117: Ser Thr Val Thr Glu Arg Asp IleLys Val Glu Glu Glu Val Tyr Gln 1 5 10 15 Cys Cys Asp Leu Glu Pro GluAla Arg Lys Val Ile Ala Ala Leu Thr 20 25 30 Glu Arg Leu Tyr Val Gly GlyPro Met His Asn Ser Lys Gly Asp Leu 35 40 45 Cys Gly Tyr Arg Arg Cys ArgAla Ser Gly Val Tyr Thr Thr Ser Phe 50 55 60 Gly Asn Thr Met Thr Cys TyrLeu Lys Ala Ser Ala Ala Ile Arg Ala 65 70 75 80 Ala Gly Leu Lys Asp CysThr Met Leu Val Cys Gly Asp Asp Leu Val 85 90 95 Val Ile Ala Glu Ser GlyGly Val Glu Glu Asp Lys Arg Ala Leu Gly 100 105 110 Ala (2) INFORMATIONFOR SEQ ID NO: 118: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 574 basepairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY:linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE:NO (vii) IMMEDIATE SOURCE: (B) CLONE: GB358-4-1 (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 1..574 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:118: ACT TGC GGC TTT GCC GAC CTC ATG GGA TAC ATC CCG CTC GTA GGC GCC 48Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 1015 CCT GTG GGT GGC GTC GCC AGG GCC CTG GCA CAC GGT GTT AGG GCT GTG 96Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 20 25 30GAG GAC GGG ATC AAT TAT GCG ACA GGG AAT CTT CCC GGT TGC TCT TTC 144 GluAsp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45 TCTATC TTC CTC TTG GCA CTT CTT TCG TGC CTG ACT GTT CCC ACC TCG 192 Ser IlePhe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser 50 55 60 GCC GTCAAC TAT CGC AAT GCC TCG GGC ATC TAT CAC ATC ACC AAT GAC 240 Ala Val AsnTyr Arg Asn Ala Ser Gly Ile Tyr His Ile Thr Asn Asp 65 70 75 80 TGC CCGAAC TCG AGC ATA GTG TAC GAG ACC GAG CAC CAC ATC CTA CAC 288 Cys Pro AsnSer Ser Ile Val Tyr Glu Thr Glu His His Ile Leu His 85 90 95 CTC CCA GGGTGT TTA CCC TGC GTG AGG GTT GGG AAT CAG TCA CGC TGC 336 Leu Pro Gly CysLeu Pro Cys Val Arg Val Gly Asn Gln Ser Arg Cys 100 105 110 TGG GTG GCCCTC ACT CCC ACC GTG GCG GCG CCT TAC ATC GGC GCT CCG 384 Trp Val Ala LeuThr Pro Thr Val Ala Ala Pro Tyr Ile Gly Ala Pro 115 120 125 CTT GAA TCCCTC CGG AGT CAT GTG GAT CTG ATG GTA GGT GCC GCT ACT 432 Leu Glu Ser LeuArg Ser His Val Asp Leu Met Val Gly Ala Ala Thr 130 135 140 GCG TGC TCCGCT CTT TAC ATC GGA GAC CTG TGC GGT GGC GTA TTC TTG 480 Ala Cys Ser AlaLeu Tyr Ile Gly Asp Leu Cys Gly Gly Val Phe Leu 145 150 155 160 GTT GGTCAG ATG TTC TCT TTC CAG CCG CGG CGC CAC TGG ACT ACG CAG 528 Val Gly GlnMet Phe Ser Phe Gln Pro Arg Arg His Trp Thr Thr Gln 165 170 175 GAC TGCAAT TGT TCC ATC TAC GCG GGG CAC GTT ACG GGC CAC AGG A 574 Asp Cys AsnCys Ser Ile Tyr Ala Gly His Val Thr Gly His Arg 180 185 190 (2)INFORMATION FOR SEQ ID NO: 119: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 191 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii)MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: ThrCys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 10 15Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 20 25 30Glu Asp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45Ser Ile Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser 50 55 60Ala Val Asn Tyr Arg Asn Ala Ser Gly Ile Tyr His Ile Thr Asn Asp 65 70 7580 Cys Pro Asn Ser Ser Ile Val Tyr Glu Thr Glu His His Ile Leu His 85 9095 Leu Pro Gly Cys Leu Pro Cys Val Arg Val Gly Asn Gln Ser Arg Cys 100105 110 Trp Val Ala Leu Thr Pro Thr Val Ala Ala Pro Tyr Ile Gly Ala Pro115 120 125 Leu Glu Ser Leu Arg Ser His Val Asp Leu Met Val Gly Ala AlaThr 130 135 140 Ala Cys Ser Ala Leu Tyr Ile Gly Asp Leu Cys Gly Gly ValPhe Leu 145 150 155 160 Val Gly Gln Met Phe Ser Phe Gln Pro Arg Arg HisTrp Thr Thr Gln 165 170 175 Asp Cys Asn Cys Ser Ile Tyr Ala Gly His ValThr Gly His Arg 180 185 190 (2) INFORMATION FOR SEQ ID NO: 120: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 574 base pairs (B) TYPE: nucleicacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (vii) IMMEDIATE SOURCE:(B) CLONE: GB549-4-3 (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION:1..574 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: ACG TGC GGC TTT GCCGAC CTC ATG GGA TAC ATC CCG CTC GTG GGC GCC 48 Thr Cys Gly Phe Ala AspLeu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 10 15 CCT GTG GGT GGC GTCGCC AGG GCC TTG GCA CAT GGT GTC AGG GCC GTG 96 Pro Val Gly Gly Val AlaArg Ala Leu Ala His Gly Val Arg Ala Val 20 25 30 GAG GAC GGG ATT AAC TATGCA ACA GGG AAT CTT CCC GGT TGC TCC TTT 144 Glu Asp Gly Ile Asn Tyr AlaThr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45 TCT ATC TTC CTT CTA GCA CTTCTC TCG TGC TTG ACT GTC CCG GCC TCG 192 Ser Ile Phe Leu Leu Ala Leu LeuSer Cys Leu Thr Val Pro Ala Ser 50 55 60 GCG CAG CAC TAC CGG AAC ATC TCGGGC ATT TAT CAC GTC ACC AAT GAC 240 Ala Gln His Tyr Arg Asn Ile Ser GlyIle Tyr His Val Thr Asn Asp 65 70 75 80 TGC CCG AAC TCT AGT ATA GTG TATGAA GCT GAC CAT CAT ATC ATG CAT 288 Cys Pro Asn Ser Ser Ile Val Tyr GluAla Asp His His Ile Met His 85 90 95 CTA CCA GGG TGT GTG CCT TGC GTG AGAACC GGG AAC ACC TCG CGC TGC 336 Leu Pro Gly Cys Val Pro Cys Val Arg ThrGly Asn Thr Ser Arg Cys 100 105 110 TGG GTT CCT TTA ACA CCC ACT GTG GCTGCC CCC TAT GTT GGC GCG CCG 384 Trp Val Pro Leu Thr Pro Thr Val Ala AlaPro Tyr Val Gly Ala Pro 115 120 125 CTC GAA TCC ATG CGG CGG CAC GTG GACTTA ATG GTG GGT GCC GCC ACC 432 Leu Glu Ser Met Arg Arg His Val Asp LeuMet Val Gly Ala Ala Thr 130 135 140 GTC TGC TCG GCC CTG TAC ATC GGA GACCTT TGC GGA GGT GTC TTC CTG 480 Val Cys Ser Ala Leu Tyr Ile Gly Asp LeuCys Gly Gly Val Phe Leu 145 150 155 160 GTC GGG CAG ATG TTC ACC TTC CGGCCG CGC CGC CAT TGG ACT ACC CAG 528 Val Gly Gln Met Phe Thr Phe Arg ProArg Arg His Trp Thr Thr Gln 165 170 175 GAC TGC AAC TGC TCT ATC TAT GATGGC CAC ATC ACC GGC CAT AGA A 574 Asp Cys Asn Cys Ser Ile Tyr Asp GlyHis Ile Thr Gly His Arg 180 185 190 (2) INFORMATION FOR SEQ ID NO: 121:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 191 amino acids (B) TYPE:amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 121: Thr Cys Gly Phe Ala Asp Leu MetGly Tyr Ile Pro Leu Val Gly Ala 1 5 10 15 Pro Val Gly Gly Val Ala ArgAla Leu Ala His Gly Val Arg Ala Val 20 25 30 Glu Asp Gly Ile Asn Tyr AlaThr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45 Ser Ile Phe Leu Leu Ala LeuLeu Ser Cys Leu Thr Val Pro Ala Ser 50 55 60 Ala Gln His Tyr Arg Asn IleSer Gly Ile Tyr His Val Thr Asn Asp 65 70 75 80 Cys Pro Asn Ser Ser IleVal Tyr Glu Ala Asp His His Ile Met His 85 90 95 Leu Pro Gly Cys Val ProCys Val Arg Thr Gly Asn Thr Ser Arg Cys 100 105 110 Trp Val Pro Leu ThrPro Thr Val Ala Ala Pro Tyr Val Gly Ala Pro 115 120 125 Leu Glu Ser MetArg Arg His Val Asp Leu Met Val Gly Ala Ala Thr 130 135 140 Val Cys SerAla Leu Tyr Ile Gly Asp Leu Cys Gly Gly Val Phe Leu 145 150 155 160 ValGly Gln Met Phe Thr Phe Arg Pro Arg Arg His Trp Thr Thr Gln 165 170 175Asp Cys Asn Cys Ser Ile Tyr Asp Gly His Ile Thr Gly His Arg 180 185 190(2) INFORMATION FOR SEQ ID NO: 122: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 574 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single(D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO(iii) ANTI-SENSE: NO (vii) IMMEDIATE SOURCE: (B) CLONE: GB809-4-3 (ix)FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 1..574 (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 122: ACG TGC GGC TTC GCC GAC CTC ATG GGA TAC ATCCCG CTC GTG GGC GCC 48 Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile ProLeu Val Gly Ala 1 5 10 15 CCC GTT GGG GGC GTC GCC AGG GCC CTG GCG CATGGC GTC AGG GCT GTG 96 Pro Val Gly Gly Val Ala Arg Ala Leu Ala His GlyVal Arg Ala Val 20 25 30 GAG GAC GGG ATT AAC TAT GCG ACA GGG AAT CTT CCCGGT TGC TCT TTC 144 Glu Asp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro GlyCys Ser Phe 35 40 45 TCT ATC TTC CTC CTG GCA CTT CTT TCG TGC CTC ACT GTCCCA GCG TCA 192 Ser Ile Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val ProAla Ser 50 55 60 GCT GAG CAC TAC CGG AAT GCT TCG GGC ATC TAT CAC ATC ACCAAT GAC 240 Ala Glu His Tyr Arg Asn Ala Ser Gly Ile Tyr His Ile Thr AsnAsp 65 70 75 80 TGT CCG AAT TCC AGC GTA GTC TAT GAA ACT GAC CAC CAT ATATTG CAC 288 Cys Pro Asn Ser Ser Val Val Tyr Glu Thr Asp His His Ile LeuHis 85 90 95 TTG CCG GGG TGC GTA CCC TGC GTG AGG GCC GGG AAC GTG TCT CGTTGC 336 Leu Pro Gly Cys Val Pro Cys Val Arg Ala Gly Asn Val Ser Arg Cys100 105 110 TGG ACG CCG GTA ACA CCT ACG GTG GCT GCC GTA TCC ATG GAC GCTCCG 384 Trp Thr Pro Val Thr Pro Thr Val Ala Ala Val Ser Met Asp Ala Pro115 120 125 CTC GAG TCC TTC CGG CGG CAT GTG GAC CTA ATG GTA GGT GCG GCCACC 432 Leu Glu Ser Phe Arg Arg His Val Asp Leu Met Val Gly Ala Ala Thr130 135 140 GTG TGT TCT GTC CTC TAT GTT GGA GAC CTC TGT GGA GGT GCT TTCCTA 480 Val Cys Ser Val Leu Tyr Val Gly Asp Leu Cys Gly Gly Ala Phe Leu145 150 155 160 GTG GGG CAG ATG TTC ACC TTC CAG CCG CGT CGC CAC TGG ACCACG CAG 528 Val Gly Gln Met Phe Thr Phe Gln Pro Arg Arg His Trp Thr ThrGln 165 170 175 GAT TGT AAT TGC TCC ATC TAT ACT GGC CAT ATC ACC GGC CACAGG A 574 Asp Cys Asn Cys Ser Ile Tyr Thr Gly His Ile Thr Gly His Arg180 185 190 (2) INFORMATION FOR SEQ ID NO: 123: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 191 amino acids (B) TYPE: amino acid (D)TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 123: Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu ValGly Ala 1 5 10 15 Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly ValArg Ala Val 20 25 30 Glu Asp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro GlyCys Ser Phe 35 40 45 Ser Ile Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr ValPro Ala Ser 50 55 60 Ala Glu His Tyr Arg Asn Ala Ser Gly Ile Tyr His IleThr Asn Asp 65 70 75 80 Cys Pro Asn Ser Ser Val Val Tyr Glu Thr Asp HisHis Ile Leu His 85 90 95 Leu Pro Gly Cys Val Pro Cys Val Arg Ala Gly AsnVal Ser Arg Cys 100 105 110 Trp Thr Pro Val Thr Pro Thr Val Ala Ala ValSer Met Asp Ala Pro 115 120 125 Leu Glu Ser Phe Arg Arg His Val Asp LeuMet Val Gly Ala Ala Thr 130 135 140 Val Cys Ser Val Leu Tyr Val Gly AspLeu Cys Gly Gly Ala Phe Leu 145 150 155 160 Val Gly Gln Met Phe Thr PheGln Pro Arg Arg His Trp Thr Thr Gln 165 170 175 Asp Cys Asn Cys Ser IleTyr Thr Gly His Ile Thr Gly His Arg 180 185 190 (2) INFORMATION FOR SEQID NO: 124: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 31 base pairs (B)TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii)MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO(ix) FEATURE: (A) NAME/KEY: misc_feature (B) LOCATION: 1..31 (D) OTHERINFORMATION: /standard_name= “HCV Primer HCPr206” (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 124: TGGGGATCCC GTATGATACC CGCTGCTTTG A 31 (2)INFORMATION FOR SEQ ID NO: 125: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 30 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single(D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA (genomic) (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: YES (ix) FEATURE: (A) NAME/KEY:misc_feature (B) LOCATION: 1..30 (D) OTHER INFORMATION: /standard_name=“HCV Primer HcPr207” (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125:GGCGGAATTC CTGGTCATAG CCTCCGTGAA 30 (2) INFORMATION FOR SEQ ID NO: 126:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (iii) HYPOTHETICAL: NO (vi) ORIGINAL SOURCE: (A) ORGANISM: aminoacid (C) INDIVIDUAL ISOLATE: GB358 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:126: Val Asn Tyr Arg Asn Ala Ser Gly Ile Tyr His Ile 1 5 10 (2)INFORMATION FOR SEQ ID NO: 127: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 12 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (vi)ORIGINAL SOURCE: (A) ORGANISM: Amino acid (C) INDIVIDUAL ISOLATE: GB549(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: Gln His Tyr Arg Asn Ile SerGly Ile Tyr His Val 1 5 10 (2) INFORMATION FOR SEQ ID NO: 128: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (iii) HYPOTHETICAL: NO (vi) ORIGINAL SOURCE: (A) ORGANISM: Aminoacid (C) INDIVIDUAL ISOLATE: GB809 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:128: Glu His Tyr Arg Asn Ala Ser Gly Ile Tyr His Ile 1 5 10 (2)INFORMATION FOR SEQ ID NO: 129: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 11 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (vi)ORIGINAL SOURCE: (A) ORGANISM: amino acid (C) INDIVIDUAL ISOLATE: GB358(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: Val Tyr Glu Thr Glu His HisIle Leu His Leu 1 5 10 (2) INFORMATION FOR SEQ ID NO: 130: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 11 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide(iii) HYPOTHETICAL: NO (vi) ORIGINAL SOURCE: (A) ORGANISM: amino acid(C) INDIVIDUAL ISOLATE: GB549 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130:Val Tyr Glu Ala Asp His His Ile Met His Leu 1 5 10 (2) INFORMATION FORSEQ ID NO: 131: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 11 amino acids(B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii)MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (vi) ORIGINAL SOURCE: (A)ORGANISM: amino acid (C) INDIVIDUAL ISOLATE: GB809 (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 131: Val Tyr Glu Thr Asp His His Ile Leu His Leu1 5 10 (2) INFORMATION FOR SEQ ID NO: 132: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 13 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single(D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO(vi) ORIGINAL SOURCE: (A) ORGANISM: amino acid (C) INDIVIDUAL ISOLATE:GB358 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: Val Arg Val Gly Asn GlnSer Arg Cys Trp Val Ala Leu 1 5 10 (2) INFORMATION FOR SEQ ID NO: 133:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 13 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (iii) HYPOTHETICAL: NO (vi) ORIGINAL SOURCE: (A) ORGANISM: aminoacid (C) INDIVIDUAL ISOLATE: GB549 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:133: Val Arg Thr Gly Asn Thr Ser Arg Cys Trp Val Pro Leu 1 5 10 (2)INFORMATION FOR SEQ ID NO: 134: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 13 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (vi)ORIGINAL SOURCE: (A) ORGANISM: amino acid (C) INDIVIDUAL ISOLATE: GB809(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: Val Arg Ala Gly Asn Val SerArg Cys Trp Thr Pro Val 1 5 10 (2) INFORMATION FOR SEQ ID NO: 135: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 10 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (iii) HYPOTHETICAL: NO (vi) ORIGINAL SOURCE: (A) ORGANISM: aminoacid (C) INDIVIDUAL ISOLATE: GB358 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:135: Ala Pro Tyr Ile Gly Ala Pro Leu Glu Ser 1 5 10 (2) INFORMATION FORSEQ ID NO: 136: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 10 amino acids(B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii)MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (vi) ORIGINAL SOURCE: (A)ORGANISM: amino acid (C) INDIVIDUAL ISOLATE: GB549 (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 136: Ala Pro Tyr Val Gly Ala Pro Leu Glu Ser 1 510 (2) INFORMATION FOR SEQ ID NO: 137: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 10 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (vi)ORIGINAL SOURCE: (A) ORGANISM: amino acid (C) INDIVIDUAL ISOLATE: GB809(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: Ala Val Ser Met Asp Ala ProLeu Glu Ser 1 5 10 (2) INFORMATION FOR SEQ ID NO: 138: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 10 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide(iii) HYPOTHETICAL: NO (vi) ORIGINAL SOURCE: (A) ORGANISM: amino acid(C) INDIVIDUAL ISOLATE: GB358 and GB809 (xi) SEQUENCE DESCRIPTION: SEQID NO: 138: Gln Pro Arg Arg His Trp Thr Thr Gln Asp 1 5 10 (2)INFORMATION FOR SEQ ID NO: 139: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 10 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (vi)ORIGINAL SOURCE: (A) ORGANISM: amino acid (C) INDIVIDUAL ISOLATE: GB549(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: Arg Pro Arg Arg His Trp ThrThr Gln Asp 1 5 10 (2) INFORMATION FOR SEQ ID NO: 140: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 10 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide(iii) HYPOTHETICAL: NO (vi) ORIGINAL SOURCE: (A) ORGANISM: amino acid(C) INDIVIDUAL ISOLATE: GB549 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140:Arg Pro Arg Arg His Trp Thr Thr Gln Asp 1 5 10 (2) INFORMATION FOR SEQID NO: 141: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 base pairs (B)TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii)MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 141: TGGGATATGA TGATGAACTG GTC 23 (2)INFORMATION FOR SEQ ID NO: 142: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single(D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO(iii) ANTI-SENSE: YES (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142:CCAGGTACAA CCGAACCAAT TGCC 24 (2) INFORMATION FOR SEQ ID NO: 143: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 957 base pairs (B) TYPE: nucleicacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 1..957 (ix) FEATURE: (A) NAME/KEY:mat_peptide (B) LOCATION: 1..954 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:143: ATG AGC ACA AAT CCT AAA CCT CAA AGA AAA ACC AAA AGA AAC ACT AAC 48Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn 1 5 1015 CGC CGC CCA CAG GAC GTC AAG TTC CCG GGC GGT GGC CAG ATC GTT GGT 96Arg Arg Pro Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly 20 25 30GGA GTA TAC TTG TTG CCG CGC AGG GGC CCC CGG TTG GGT GTG CGC GCG 144 GlyVal Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 35 40 45 ACGAGG AAA ACT TCC GAG CGG TCC CAG CCA CGT GGG AGG CGC CAG CCC 192 Thr ArgLys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro 50 55 60 ATC CCCAAA GAT CGG CGC CCC ACT GGC AAG TCC TGG GGA AAA CCA GGA 240 Ile Pro LysAsp Arg Arg Pro Thr Gly Lys Ser Trp Gly Lys Pro Gly 65 70 75 80 TAC CCTTGG CCC CTG TAC GGG AAT GAG GGC CTC GGC TGG GCA GGG TGG 288 Tyr Pro TrpPro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 85 90 95 CTC CTG TCCCCC CGA GGG TCT CGC CCG TCA TGG GGC CCA ACT GAC CCC 336 Leu Leu Ser ProArg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 100 105 110 CGG CAC AGGTCA CGC AAC TTG GGT AAG GTC ATC GAT ACC CTT ACG TGT 384 Arg His Arg SerArg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys 115 120 125 GGC TTT GCCGAC CTC ATG GGG TAC ATC CCT GTC GTC GGC GCC CCA GTT 432 Gly Phe Ala AspLeu Met Gly Tyr Ile Pro Val Val Gly Ala Pro Val 130 135 140 GGT GGT GTCGCC AGA GCT CTC GCG CAT GGC GTG AGA GTT CTG GAA GAC 480 Gly Gly Val AlaArg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 145 150 155 160 GGG ATAAAC TAT GCA ACA GGG AAC TTG CCC GGT TGC TCC TTT TCT ATC 528 Gly Ile AsnTyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser Ile 165 170 175 TTC TTATTG GCC CTG CTA TCT TGT ATC ACT GTG CCG GTC TCC GGC TTG 576 Phe Leu LeuAla Leu Leu Ser Cys Ile Thr Val Pro Val Ser Gly Leu 180 185 190 CAG GTCAAG AAC ACC AGC AGC TCT TAC ATG GTA ACC AAT GAC TGC CAG 624 Gln Val LysAsn Thr Ser Ser Ser Tyr Met Val Thr Asn Asp Cys Gln 195 200 205 AAC AGTAGC ATC GTC TGG CAG CTC AGG GAT GCT GTT CTT CAC GTC CCC 672 Asn Ser SerIle Val Trp Gln Leu Arg Asp Ala Val Leu His Val Pro 210 215 220 GGG TGTGTC CCT TGT GAG GAG AAG GGC AAC ATA TCC CGC TGT TGG ATA 720 Gly Cys ValPro Cys Glu Glu Lys Gly Asn Ile Ser Arg Cys Trp Ile 225 230 235 240 CCGGTT TCG CCC AAT ATA GCT GTG AGC CAA CCT GGT GCG CTT ACC AAG 768 Pro ValSer Pro Asn Ile Ala Val Ser Gln Pro Gly Ala Leu Thr Lys 245 250 255 GGCCTG CGG ACG CAT ATT GAT ACC ATC ATT GCA TCC GCT ACG TTT TGC 816 Gly LeuArg Thr His Ile Asp Thr Ile Ile Ala Ser Ala Thr Phe Cys 260 265 270 TCTGCC CTG TAC ATA GGA GAC CTG TGT GGC GCG GTG ATG TTG GCT TCT 864 Ser AlaLeu Tyr Ile Gly Asp Leu Cys Gly Ala Val Met Leu Ala Ser 275 280 285 CAAGTC TTC ATC ATC TCG CCC CAG CAT CAT AAG TTT GTC CAG GAC TGC 912 Gln ValPhe Ile Ile Ser Pro Gln His His Lys Phe Val Gln Asp Cys 290 295 300 AACTGT TCC ATA TAC CCA GGC CAC ATC ACT GGA CAT CGG ATG GCG 957 Asn Cys SerIle Tyr Pro Gly His Ile Thr Gly His Arg Met Ala 305 310 315 (2)INFORMATION FOR SEQ ID NO: 144: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 319 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii)MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: MetSer Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn 1 5 10 15Arg Arg Pro Gln Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly 20 25 30Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 35 40 45Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro 50 55 60Ile Pro Lys Asp Arg Arg Pro Thr Gly Lys Ser Trp Gly Lys Pro Gly 65 70 7580 Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 85 9095 Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 100105 110 Arg His Arg Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys115 120 125 Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Val Val Gly Ala ProVal 130 135 140 Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val LeuGlu Asp 145 150 155 160 Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly CysSer Phe Ser Ile 165 170 175 Phe Leu Leu Ala Leu Leu Ser Cys Ile Thr ValPro Val Ser Gly Leu 180 185 190 Gln Val Lys Asn Thr Ser Ser Ser Tyr MetVal Thr Asn Asp Cys Gln 195 200 205 Asn Ser Ser Ile Val Trp Gln Leu ArgAsp Ala Val Leu His Val Pro 210 215 220 Gly Cys Val Pro Cys Glu Glu LysGly Asn Ile Ser Arg Cys Trp Ile 225 230 235 240 Pro Val Ser Pro Asn IleAla Val Ser Gln Pro Gly Ala Leu Thr Lys 245 250 255 Gly Leu Arg Thr HisIle Asp Thr Ile Ile Ala Ser Ala Thr Phe Cys 260 265 270 Ser Ala Leu TyrIle Gly Asp Leu Cys Gly Ala Val Met Leu Ala Ser 275 280 285 Gln Val PheIle Ile Ser Pro Gln His His Lys Phe Val Gln Asp Cys 290 295 300 Asn CysSer Ile Tyr Pro Gly His Ile Thr Gly His Arg Met Ala 305 310 315 (2)INFORMATION FOR SEQ ID NO: 145: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 340 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single(D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO(iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY: mat_peptide (B)LOCATION: 2..337 (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 2..340(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: C TCA ACG GTC ACG GAG AGG GACATC AGA ACT GAG GAG TCC ATA TAC 46 Ser Thr Val Thr Glu Arg Asp Ile ArgThr Glu Glu Ser Ile Tyr 1 5 10 15 CTT GCT TGC TCT TTA CCC GAG CAG GCACGG ACT GCC ATA CAC TCA CTG 94 Leu Ala Cys Ser Leu Pro Glu Gln Ala ArgThr Ala Ile His Ser Leu 20 25 30 ACT GAG AGG CTT TAC GTG GGA GGG CCC ATGCTA AAC AGC AAA GGG CAA 142 Thr Glu Arg Leu Tyr Val Gly Gly Pro Met LeuAsn Ser Lys Gly Gln 35 40 45 ACC TGC GGA TAC AGA CGC TGC CGC GCC AGC GGAGTG TTC ACC ACT AGC 190 Thr Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly ValPhe Thr Thr Ser 50 55 60 ATG GGA AAT ACC ATC ACG TGC TAC GTG AAG GCA CAAGCA GCC TGT AAG 238 Met Gly Asn Thr Ile Thr Cys Tyr Val Lys Ala Gln AlaAla Cys Lys 65 70 75 GCT GCG GGC ATA ATT GCC CCC ACG ATG CTG GTG TGC GGCGAC GAT CTA 286 Ala Ala Gly Ile Ile Ala Pro Thr Met Leu Val Cys Gly AspAsp Leu 80 85 90 95 GTT GTC ATC TCA GAG AGT CAG GGG ACC GAG GAG GAC GAGCGG AAC CTA 334 Val Val Ile Ser Glu Ser Gln Gly Thr Glu Glu Asp Glu ArgAsn Leu 100 105 110 CGA GCC 340 Arg Ala (2) INFORMATION FOR SEQ ID NO:146: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 113 amino acids (B) TYPE:amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 146: Ser Thr Val Thr Glu Arg Asp IleArg Thr Glu Glu Ser Ile Tyr Leu 1 5 10 15 Ala Cys Ser Leu Pro Glu GlnAla Arg Thr Ala Ile His Ser Leu Thr 20 25 30 Glu Arg Leu Tyr Val Gly GlyPro Met Leu Asn Ser Lys Gly Gln Thr 35 40 45 Cys Gly Tyr Arg Arg Cys ArgAla Ser Gly Val Phe Thr Thr Ser Met 50 55 60 Gly Asn Thr Ile Thr Cys TyrVal Lys Ala Gln Ala Ala Cys Lys Ala 65 70 75 80 Ala Gly Ile Ile Ala ProThr Met Leu Val Cys Gly Asp Asp Leu Val 85 90 95 Val Ile Ser Glu Ser GlnGly Thr Glu Glu Asp Glu Arg Asn Leu Arg 100 105 110 Ala (2) INFORMATIONFOR SEQ ID NO: 147: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 346 basepairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY:linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE:NO (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 1..346 (ix) FEATURE:(A) NAME/KEY: mat_peptide (B) LOCATION: 1..342 (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 147: ATG AGC ACA CTT CCT AAA CCA CAA AGA AAA ACCAAA AGA AAC ACC AAC 48 Met Ser Thr Leu Pro Lys Pro Gln Arg Lys Thr LysArg Asn Thr Asn 1 5 10 15 CSC CGG CCA CAG GAC GTT AAG TTC CCA GGC GGCGGT CAG ATC GTT GGT 96 Xaa Arg Pro Gln Asp Val Lys Phe Pro Gly Gly GlyGln Ile Val Gly 20 25 30 GGA GTT TAC GTG CTA CCA CGC AGG GGC CCC CAG TTGGGT GTG CGT GCA 144 Gly Val Tyr Val Leu Pro Arg Arg Gly Pro Gln Leu GlyVal Arg Ala 35 40 45 GTG CGC AAG ACT TCC GAG CGG TCG CAA CCT CGC AGT AGGCGC CAA CCC 192 Val Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Ser Arg ArgGln Pro 50 55 60 ATC CCC AGG GCG CGC CGA ACC GAG GGC AGG TCC TGG GCT CAGCCC GGG 240 Ile Pro Arg Ala Arg Arg Thr Glu Gly Arg Ser Trp Ala Gln ProGly 65 70 75 80 TAC CCT TGG CCC CTA TAT GGG AAT GAG GGC TGC GGG TGG GCAGGG TGG 288 Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala GlyTrp 85 90 95 CTC CTG TCC CCG CGC GGC TCT CGC CCG TCG TGG GGC CCA AAT GACCCC 336 Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro100 105 110 CGG CGC AGG A 346 Arg Arg Arg 115 (2) INFORMATION FOR SEQ IDNO: 148: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 115 amino acids (B)TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 148: Met Ser Thr Leu Pro Lys Pro GlnArg Lys Thr Lys Arg Asn Thr Asn 1 5 10 15 Xaa Arg Pro Gln Asp Val LysPhe Pro Gly Gly Gly Gln Ile Val Gly 20 25 30 Gly Val Tyr Val Leu Pro ArgArg Gly Pro Gln Leu Gly Val Arg Ala 35 40 45 Val Arg Lys Thr Ser Glu ArgSer Gln Pro Arg Ser Arg Arg Gln Pro 50 55 60 Ile Pro Arg Ala Arg Arg ThrGlu Gly Arg Ser Trp Ala Gln Pro Gly 65 70 75 80 Tyr Pro Trp Pro Leu TyrGly Asn Glu Gly Cys Gly Trp Ala Gly Trp 85 90 95 Leu Leu Ser Pro Arg GlySer Arg Pro Ser Trp Gly Pro Asn Asp Pro 100 105 110 Arg Arg Arg 115 (2)INFORMATION FOR SEQ ID NO: 149: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 280 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single(D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 2..280 (ix) FEATURE: (A) NAME/KEY:mat_peptide (B) LOCATION: 2..277 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:149: G GCC TGT GAC CTC AAG GAC GAG GCT AGG AGG GTG ATA ACT TCA CTC 46Ala Cys Asp Leu Lys Asp Glu Ala Arg Arg Val Ile Thr Ser Leu 1 5 10 15ACG GAG CGG CTT TAC TGT GGT GGT CCT ATG TTC AAC AGC AAG GGA CAA 94 ThrGlu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn Ser Lys Gly Gln 20 25 30 CACTGC GGT TAC CGC CGC TGC CGT GCT AGT GGG GTG CTA CCC ACC AGC 142 His CysGly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr Ser 35 40 45 TTC GGGAAC ACA ATC ACC TGT TAC ATC AAA GCA AAG GCA GCT ACC AAA 190 Phe Gly AsnThr Ile Thr Cys Tyr Ile Lys Ala Lys Ala Ala Thr Lys 50 55 60 GCT GCC GGAATT AAA AAT CCA TCA TTC CTT GTC TGC GGA GAT GAC TTG 238 Ala Ala Gly IleLys Asn Pro Ser Phe Leu Val Cys Gly Asp Asp Leu 65 70 75 GTC GTG ATT GCTGAG AGT GCA GGG ATC GAT GAG GAC AGA GCG 280 Val Val Ile Ala Glu Ser AlaGly Ile Asp Glu Asp Arg Ala 80 85 90 (2) INFORMATION FOR SEQ ID NO: 150:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 93 amino acids (B) TYPE: aminoacid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 150: Ala Cys Asp Leu Lys Asp Glu Ala Arg Arg ValIle Thr Ser Leu Thr 1 5 10 15 Glu Arg Leu Tyr Cys Gly Gly Pro Met PheAsn Ser Lys Gly Gln His 20 25 30 Cys Gly Tyr Arg Arg Cys Arg Ala Ser GlyVal Leu Pro Thr Ser Phe 35 40 45 Gly Asn Thr Ile Thr Cys Tyr Ile Lys AlaLys Ala Ala Thr Lys Ala 50 55 60 Ala Gly Ile Lys Asn Pro Ser Phe Leu ValCys Gly Asp Asp Leu Val 65 70 75 80 Val Ile Ala Glu Ser Ala Gly Ile AspGlu Asp Arg Ala 85 90 (2) INFORMATION FOR SEQ ID NO: 151: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 499 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY: CDS(B) LOCATION: 1..499 (ix) FEATURE: (A) NAME/KEY: mat_peptide (B)LOCATION: 1..496 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: ATG AGC ACGAAT CCT AAA CCT CAA AGA AAA ACC AAA AGA AAC ACC AAC 48 Met Ser Thr AsnPro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn 1 5 10 15 CGT CGC CCACAG GAC GTC AAG TTC CCG GGC GGT GGT CAG ATC GTT GGC 96 Arg Arg Pro GlnAsp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly 20 25 30 GGA GTT TAC TTGTTG CCG CGC AGG GGC CCT AGG ATG GGT GTG CGC GCG 144 Gly Val Tyr Leu LeuPro Arg Arg Gly Pro Arg Met Gly Val Arg Ala 35 40 45 ACT CGG AAG ACT TCGGAA CGG TCG CAA CCC CGT GGA CGG CGT CAG CCT 192 Thr Arg Lys Thr Ser GluArg Ser Gln Pro Arg Gly Arg Arg Gln Pro 50 55 60 ATT CCC AAG GCG CGC CAGCCC ACG GGC CGG TCC TGG GGT CAA CCC GGG 240 Ile Pro Lys Ala Arg Gln ProThr Gly Arg Ser Trp Gly Gln Pro Gly 65 70 75 80 TAC CCT TGG CCC CTT TACGCC AAT GAG GGC CTC GGG TGG GCA GGG TGG 288 Tyr Pro Trp Pro Leu Tyr AlaAsn Glu Gly Leu Gly Trp Ala Gly Trp 85 90 95 CTG CTC TCC CCT CGA GGC TCTCGG CCT AAT TGG GGC CCC AAT GAC CCC 336 Leu Leu Ser Pro Arg Gly Ser ArgPro Asn Trp Gly Pro Asn Asp Pro 100 105 110 CGG CGA AAA TCG CGT AAT TTGGGT AAG GTC ATC GAT ACC CTA ACG TGC 384 Arg Arg Lys Ser Arg Asn Leu GlyLys Val Ile Asp Thr Leu Thr Cys 115 120 125 GGA TTC GCC GAT CTC ATG GGGTAT ATC CCG CTC GTA GGC GGC CCC ATT 432 Gly Phe Ala Asp Leu Met Gly TyrIle Pro Leu Val Gly Gly Pro Ile 130 135 140 GGG GGC GTC GCA AGG GCT CTCGCA CAC GGT GTG AGG GTC CTT GAG GAC 480 Gly Gly Val Ala Arg Ala Leu AlaHis Gly Val Arg Val Leu Glu Asp 145 150 155 160 GGG GTA AAC TAT GCA ACAG 499 Gly Val Asn Tyr Ala Thr 165 (2) INFORMATION FOR SEQ ID NO: 152:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 166 amino acids (B) TYPE:amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 152: Met Ser Thr Asn Pro Lys Pro GlnArg Lys Thr Lys Arg Asn Thr Asn 1 5 10 15 Arg Arg Pro Gln Asp Val LysPhe Pro Gly Gly Gly Gln Ile Val Gly 20 25 30 Gly Val Tyr Leu Leu Pro ArgArg Gly Pro Arg Met Gly Val Arg Ala 35 40 45 Thr Arg Lys Thr Ser Glu ArgSer Gln Pro Arg Gly Arg Arg Gln Pro 50 55 60 Ile Pro Lys Ala Arg Gln ProThr Gly Arg Ser Trp Gly Gln Pro Gly 65 70 75 80 Tyr Pro Trp Pro Leu TyrAla Asn Glu Gly Leu Gly Trp Ala Gly Trp 85 90 95 Leu Leu Ser Pro Arg GlySer Arg Pro Asn Trp Gly Pro Asn Asp Pro 100 105 110 Arg Arg Lys Ser ArgAsn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys 115 120 125 Gly Phe Ala AspLeu Met Gly Tyr Ile Pro Leu Val Gly Gly Pro Ile 130 135 140 Gly Gly ValAla Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 145 150 155 160 GlyVal Asn Tyr Ala Thr 165 (2) INFORMATION FOR SEQ ID NO: 153: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 579 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY: CDS(B) LOCATION: 1..579 (ix) FEATURE: (A) NAME/KEY: mat_peptide (B)LOCATION: 1..576 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: ACG TGC GGATTC GCC GAT CTC ATG GGG TAC ATC CCG CTC GTA GGC GGC 48 Thr Cys Gly PheAla Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Gly 1 5 10 15 CCC GTT GGGGGC GTC GCA AGG GCT CTC GCA CAC GGT GTG AGG GTC CTT 96 Pro Val Gly GlyVal Ala Arg Ala Leu Ala His Gly Val Arg Val Leu 20 25 30 GAG GAC GGG GTAAAC TAT CCA ACA GGG AAT TTA CCC GGT TGC TCT TTC 144 Glu Asp Gly Val AsnTyr Pro Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45 TCT ATC TTT ATT CTTGCT CTT CTC TCG TGT CTG ACC GTT CCG GCC TCT 192 Ser Ile Phe Ile Leu AlaLeu Leu Ser Cys Leu Thr Val Pro Ala Ser 50 55 60 GCA GTT CCC TAC CGA AATGCC TCT GGG ATT TAT CAT GTT ACC AAT GAT 240 Ala Val Pro Tyr Arg Asn AlaSer Gly Ile Tyr His Val Thr Asn Asp 65 70 75 80 TGC CCA AAC TCT TCC ATAGTC TAT GAG GCA GAT AAC CTG ATC CTA CAC 288 Cys Pro Asn Ser Ser Ile ValTyr Glu Ala Asp Asn Leu Ile Leu His 85 90 95 GCA CCT GGT TGC GTG CCT TGTGTC ATG ACA GGT AAT GTG AGT AGA TGC 336 Ala Pro Gly Cys Val Pro Cys ValMet Thr Gly Asn Val Ser Arg Cys 100 105 110 TGG GTC CAA ATT ACC CCT ACACTG TCA GCC CCG AGC CTC GGA GCA GTC 384 Trp Val Gln Ile Thr Pro Thr LeuSer Ala Pro Ser Leu Gly Ala Val 115 120 125 ACG GCT CCT CTT CGG AGA GCCGTT GAC TAC CTA GCG GGA GGG GCT GCC 432 Thr Ala Pro Leu Arg Arg Ala ValAsp Tyr Leu Ala Gly Gly Ala Ala 130 135 140 CTC TGC TCC GCG TTA TAC GTAGGA GAC GCG TGT GGG GCA CTA TTC TTG 480 Leu Cys Ser Ala Leu Tyr Val GlyAsp Ala Cys Gly Ala Leu Phe Leu 145 150 155 160 GTA GGC CAA ATG TTC ACCTAT AGG CCT CGC CAG CAC GCT ACG GTG CAG 528 Val Gly Gln Met Phe Thr TyrArg Pro Arg Gln His Ala Thr Val Gln 165 170 175 AAC TGC AAC TGT TCC ATTTAC AGT GGC CAT GTT ACC GGC CAC CGG ATG 576 Asn Cys Asn Cys Ser Ile TyrSer Gly His Val Thr Gly His Arg Met 180 185 190 GCG 579 Ala (2)INFORMATION FOR SEQ ID NO: 154: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 193 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii)MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: ThrCys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Gly 1 5 10 15Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu 20 25 30Glu Asp Gly Val Asn Tyr Pro Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45Ser Ile Phe Ile Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 50 55 60Ala Val Pro Tyr Arg Asn Ala Ser Gly Ile Tyr His Val Thr Asn Asp 65 70 7580 Cys Pro Asn Ser Ser Ile Val Tyr Glu Ala Asp Asn Leu Ile Leu His 85 9095 Ala Pro Gly Cys Val Pro Cys Val Met Thr Gly Asn Val Ser Arg Cys 100105 110 Trp Val Gln Ile Thr Pro Thr Leu Ser Ala Pro Ser Leu Gly Ala Val115 120 125 Thr Ala Pro Leu Arg Arg Ala Val Asp Tyr Leu Ala Gly Gly AlaAla 130 135 140 Leu Cys Ser Ala Leu Tyr Val Gly Asp Ala Cys Gly Ala LeuPhe Leu 145 150 155 160 Val Gly Gln Met Phe Thr Tyr Arg Pro Arg Gln HisAla Thr Val Gln 165 170 175 Asn Cys Asn Cys Ser Ile Tyr Ser Gly His ValThr Gly His Arg Met 180 185 190 Ala (2) INFORMATION FOR SEQ ID NO: 155:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 579 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 1..579 (ix) FEATURE: (A) NAME/KEY:mat_peptide (B) LOCATION: 1..576 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:155: ACG TGC GGA TTC GCC GAC CTC GTG GGG TAC ATC CCG CTC GTA GGC GGC 48Thr Cys Gly Phe Ala Asp Leu Val Gly Tyr Ile Pro Leu Val Gly Gly 1 5 1015 CCC GTT GGG GGC GTC GCA AGG GCT CTC GCA CAT GGT GTG AGG GTT CTT 96Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu 20 25 30GAG GAC GGG GTG AAT TAT GCA ACA GGG AAT CTG CCT GGT TGC TCT TTC 144 GluAsp Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45 TCTATC TTC ATT CTT GCA CTT CTC TCG TGC CTC ACT GTC CCG GCC TCT 192 Ser IlePhe Ile Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 50 55 60 GCA GTTCCC TAC CGA AAT GCC TCT GGG ATC TAT CAT GTC ACC AAT GAT 240 Ala Val ProTyr Arg Asn Ala Ser Gly Ile Tyr His Val Thr Asn Asp 65 70 75 80 TGC CCAAAC TCT TCC ATA GTC TAT GAG GCA GAT GAT CTG ATC CTA CAC 288 Cys Pro AsnSer Ser Ile Val Tyr Glu Ala Asp Asp Leu Ile Leu His 85 90 95 GCA CCT GGCTGC GTG CCT TGT GTC AGG AAA GAT AAT GTG AGT AGG TGC 336 Ala Pro Gly CysVal Pro Cys Val Arg Lys Asp Asn Val Ser Arg Cys 100 105 110 TGG GTC CAAATT ACC CCC ACG CTG TCA GCC CCG AGC TTC GGA GCA GTC 384 Trp Val Gln IleThr Pro Thr Leu Ser Ala Pro Ser Phe Gly Ala Val 115 120 125 ACG GCT CCCCTT CGG AGA GCC GTT GAT TAC TTG GTG GGA GGG GCT GCC 432 Thr Ala Pro LeuArg Arg Ala Val Asp Tyr Leu Val Gly Gly Ala Ala 130 135 140 CTC TGC TCCGCG TTA TAC GTT GGA GAC GCG TGT GGG GCA CTA TTT TTG 480 Leu Cys Ser AlaLeu Tyr Val Gly Asp Ala Cys Gly Ala Leu Phe Leu 145 150 155 160 GTA GGCCAA ATG TTC ACC TAT AGG CCT CGC CAG CAT GCT ACG GTG CAG 528 Val Gly GlnMet Phe Thr Tyr Arg Pro Arg Gln His Ala Thr Val Gln 165 170 175 GAC TGCAAC TGT TCC ATC TAC AGT GGC CAC GTC ACC GGC CAT CAG ATG 576 Asp Cys AsnCys Ser Ile Tyr Ser Gly His Val Thr Gly His Gln Met 180 185 190 GCA 579Ala (2) INFORMATION FOR SEQ ID NO: 156: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 193 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156:Thr Cys Gly Phe Ala Asp Leu Val Gly Tyr Ile Pro Leu Val Gly Gly 1 5 1015 Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu 20 2530 Glu Asp Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 4045 Ser Ile Phe Ile Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 50 5560 Ala Val Pro Tyr Arg Asn Ala Ser Gly Ile Tyr His Val Thr Asn Asp 65 7075 80 Cys Pro Asn Ser Ser Ile Val Tyr Glu Ala Asp Asp Leu Ile Leu His 8590 95 Ala Pro Gly Cys Val Pro Cys Val Arg Lys Asp Asn Val Ser Arg Cys100 105 110 Trp Val Gln Ile Thr Pro Thr Leu Ser Ala Pro Ser Phe Gly AlaVal 115 120 125 Thr Ala Pro Leu Arg Arg Ala Val Asp Tyr Leu Val Gly GlyAla Ala 130 135 140 Leu Cys Ser Ala Leu Tyr Val Gly Asp Ala Cys Gly AlaLeu Phe Leu 145 150 155 160 Val Gly Gln Met Phe Thr Tyr Arg Pro Arg GlnHis Ala Thr Val Gln 165 170 175 Asp Cys Asn Cys Ser Ile Tyr Ser Gly HisVal Thr Gly His Gln Met 180 185 190 Ala (2) INFORMATION FOR SEQ ID NO:157: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 530 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 3..530 (ix) FEATURE: (A) NAME/KEY:mat_peptide (B) LOCATION: 3..527 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:157: CA CCT ACG ACA GCT CTG CTG GTG GCC CAG TTA CTG CGG ATT CCC CAA 47Pro Thr Thr Ala Leu Leu Val Ala Gln Leu Leu Arg Ile Pro Gln 1 5 10 15GTG GTC ATT GAC ATC ATC GCA GGG AGC CAC TGG GGG GTC TTG TTT GCC 95 ValVal Ile Asp Ile Ile Ala Gly Ser His Trp Gly Val Leu Phe Ala 20 25 30 GCCGCA TAC TAT GCA TCG GTG GCT AAC TGG ACC AAG GTC GTG CTG GTC 143 Ala AlaTyr Tyr Ala Ser Val Ala Asn Trp Thr Lys Val Val Leu Val 35 40 45 TTG TTTCTG TTT GCA GGG GTT GAT GCT ACT ACC CAG ATT TCG GGC GGC 191 Leu Phe LeuPhe Ala Gly Val Asp Ala Thr Thr Gln Ile Ser Gly Gly 50 55 60 TCC AGC GCCCAA ACG ACG TAT GGC ATC GCC TCA TTT ATC ACC CGC GGC 239 Ser Ser Ala GlnThr Thr Tyr Gly Ile Ala Ser Phe Ile Thr Arg Gly 65 70 75 GCG CAG CAG AAACTG CAG CTC ATA AAT ACC AAC GGA AGC TGG CAC ATC 287 Ala Gln Gln Lys LeuGln Leu Ile Asn Thr Asn Gly Ser Trp His Ile 80 85 90 95 AAC AGG ACC GCCCTT AAT TGT AAT GAC AGC CTC CAG ACT GGG TTC ATA 335 Asn Arg Thr Ala LeuAsn Cys Asn Asp Ser Leu Gln Thr Gly Phe Ile 100 105 110 GCC GGC CTC TTCTAC TAC CAT AAG TTC AAC TCT TCT GGA TGC CCG GAT 383 Ala Gly Leu Phe TyrTyr His Lys Phe Asn Ser Ser Gly Cys Pro Asp 115 120 125 CGG ATG GCT AGCTGT AGG GCC CTT GCC ACT TTT GAC CAG GGC TGG GGA 431 Arg Met Ala Ser CysArg Ala Leu Ala Thr Phe Asp Gln Gly Trp Gly 130 135 140 ACT ATC AGC TATGCC AAC ATA TCG GGT CCC AGT GAT GAC AAA CCA TAT 479 Thr Ile Ser Tyr AlaAsn Ile Ser Gly Pro Ser Asp Asp Lys Pro Tyr 145 150 155 TGC TGG CAC TATCCC CCA CGG CCG TGC GGA GTG GTG CCA GCC CAA GAG 527 Cys Trp His Tyr ProPro Arg Pro Cys Gly Val Val Pro Ala Gln Glu 160 165 170 175 GTC 530 Val(2) INFORMATION FOR SEQ ID NO: 158: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 176 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii)MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: ProThr Thr Ala Leu Leu Val Ala Gln Leu Leu Arg Ile Pro Gln Val 1 5 10 15Val Ile Asp Ile Ile Ala Gly Ser His Trp Gly Val Leu Phe Ala Ala 20 25 30Ala Tyr Tyr Ala Ser Val Ala Asn Trp Thr Lys Val Val Leu Val Leu 35 40 45Phe Leu Phe Ala Gly Val Asp Ala Thr Thr Gln Ile Ser Gly Gly Ser 50 55 60Ser Ala Gln Thr Thr Tyr Gly Ile Ala Ser Phe Ile Thr Arg Gly Ala 65 70 7580 Gln Gln Lys Leu Gln Leu Ile Asn Thr Asn Gly Ser Trp His Ile Asn 85 9095 Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gln Thr Gly Phe Ile Ala 100105 110 Gly Leu Phe Tyr Tyr His Lys Phe Asn Ser Ser Gly Cys Pro Asp Arg115 120 125 Met Ala Ser Cys Arg Ala Leu Ala Thr Phe Asp Gln Gly Trp GlyThr 130 135 140 Ile Ser Tyr Ala Asn Ile Ser Gly Pro Ser Asp Asp Lys ProTyr Cys 145 150 155 160 Trp His Tyr Pro Pro Arg Pro Cys Gly Val Val ProAla Gln Glu Val 165 170 175 (2) INFORMATION FOR SEQ ID NO: 159: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 340 base pairs (B) TYPE: nucleicacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 2..340 (ix) FEATURE: (A) NAME/KEY:mat_peptide (B) LOCATION: 2..337 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:159: C TCG ACC GTT ACC GAA CAT GAC ATA ATG ACC GAA GAG TCC ATT TAC 46Ser Thr Val Thr Glu His Asp Ile Met Thr Glu Glu Ser Ile Tyr 1 5 10 15CAA TCA TGT GAC TTG CAG CCC GAG GCA CGC GCA GCA ATA CGG TCA CTC 94 GlnSer Cys Asp Leu Gln Pro Glu Ala Arg Ala Ala Ile Arg Ser Leu 20 25 30 ACCCAA CGC CTC TAC TGT GGA GGC CCC ATG TAC AAC AGC AAG GGG CAA 142 Thr GlnArg Leu Tyr Cys Gly Gly Pro Met Tyr Asn Ser Lys Gly Gln 35 40 45 CAG TGTGGT TAT CGC AGA TGC CGC GCC AGC GGC GTT TTC ACC ACC AGT 190 Gln Cys GlyTyr Arg Arg Cys Arg Ala Ser Gly Val Phe Thr Thr Ser 50 55 60 ATG GGC AACACC ATG ACG TGC TAC ATC AAG GCT TTA GCC TCC TGT AGA 238 Met Gly Asn ThrMet Thr Cys Tyr Ile Lys Ala Leu Ala Ser Cys Arg 65 70 75 GCC GCA AGG CTCCGG GAC TGC ACG CTC CTG GTG TGT GGT GAC GAT CTT 286 Ala Ala Arg Leu ArgAsp Cys Thr Leu Leu Val Cys Gly Asp Asp Leu 80 85 90 95 GTG GCC ATC TGCGAG AGC CAG GGG ACA CAC GAG GAT GAA GCA AGC CTG 334 Val Ala Ile Cys GluSer Gln Gly Thr His Glu Asp Glu Ala Ser Leu 100 105 110 AGA GCC 340 ArgAla (2) INFORMATION FOR SEQ ID NO: 160: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 113 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160:Ser Thr Val Thr Glu His Asp Ile Met Thr Glu Glu Ser Ile Tyr Gln 1 5 1015 Ser Cys Asp Leu Gln Pro Glu Ala Arg Ala Ala Ile Arg Ser Leu Thr 20 2530 Gln Arg Leu Tyr Cys Gly Gly Pro Met Tyr Asn Ser Lys Gly Gln Gln 35 4045 Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Phe Thr Thr Ser Met 50 5560 Gly Asn Thr Met Thr Cys Tyr Ile Lys Ala Leu Ala Ser Cys Arg Ala 65 7075 80 Ala Arg Leu Arg Asp Cys Thr Leu Leu Val Cys Gly Asp Asp Leu Val 8590 95 Ala Ile Cys Glu Ser Gln Gly Thr His Glu Asp Glu Ala Ser Leu Arg100 105 110 Ala (2) INFORMATION FOR SEQ ID NO: 161: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 340 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY: CDS(B) LOCATION: 2..340 (ix) FEATURE: (A) NAME/KEY: mat_peptide (B)LOCATION: 2..337 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: C TCA ACCGCC ACC GAA CAT GAC ATA TTG ACT GAA GAG TCC ATA TAC 46 Ser Thr Ala ThrGlu His Asp Ile Leu Thr Glu Glu Ser Ile Tyr 1 5 10 15 CAA TCA TGT GACTCG CAG CCC GAC GCA CGC GCA GCA ATA CGG TCA CTC 94 Gln Ser Cys Asp SerGln Pro Asp Ala Arg Ala Ala Ile Arg Ser Leu 20 25 30 ACC CAA CGC TTG TTCTGT GGA GGC CCC ATG TAT AAC AGC AAG GGG CAA 142 Thr Gln Arg Leu Phe CysGly Gly Pro Met Tyr Asn Ser Lys Gly Gln 35 40 45 CAA TGT GGT TAT CGC AGATGC CGC GCC AGC GGC GTC TTC ACC ACC AGT 190 Gln Cys Gly Tyr Arg Arg CysArg Ala Ser Gly Val Phe Thr Thr Ser 50 55 60 ATG GGC AAC ACC ATG ACG TGCTAC ATT AAG GCT TTA GCC TCC TGT AGA 238 Met Gly Asn Thr Met Thr Cys TyrIle Lys Ala Leu Ala Ser Cys Arg 65 70 75 ACC GCT GGG CTC CGG GAC TAC ACGCTC CTG GTG TGT GGT GAC GAT CAT 286 Thr Ala Gly Leu Arg Asp Tyr Thr LeuLeu Val Cys Gly Asp Asp His 80 85 90 95 GTG GCC ATC TGC GAG AGC CAG GGGACA CAC GAG GAT GAA GCG AAC CTG 334 Val Ala Ile Cys Glu Ser Gln Gly ThrHis Glu Asp Glu Ala Asn Leu 100 105 110 AGA GCC 340 Arg Ala (2)INFORMATION FOR SEQ ID NO: 162: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 113 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii)MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: SerThr Ala Thr Glu His Asp Ile Leu Thr Glu Glu Ser Ile Tyr Gln 1 5 10 15Ser Cys Asp Ser Gln Pro Asp Ala Arg Ala Ala Ile Arg Ser Leu Thr 20 25 30Gln Arg Leu Phe Cys Gly Gly Pro Met Tyr Asn Ser Lys Gly Gln Gln 35 40 45Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Phe Thr Thr Ser Met 50 55 60Gly Asn Thr Met Thr Cys Tyr Ile Lys Ala Leu Ala Ser Cys Arg Thr 65 70 7580 Ala Gly Leu Arg Asp Tyr Thr Leu Leu Val Cys Gly Asp Asp His Val 85 9095 Ala Ile Cys Glu Ser Gln Gly Thr His Glu Asp Glu Ala Asn Leu Arg 100105 110 Ala (2) INFORMATION FOR SEQ ID NO: 163: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 499 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY: CDS(B) LOCATION: 1..499 (ix) FEATURE: (A) NAME/KEY: mat_peptide (B)LOCATION: 1..496 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: ATG AGC ACGAAT CCT AAA CTT CAA AGA AAA ACC AAA CGT AAC ACC AAC 48 Met Ser Thr AsnPro Lys Leu Gln Arg Lys Thr Lys Arg Asn Thr Asn 1 5 10 15 CGC CGC CCCATG GAC GTT AAG TTC CCG GGT GGT GGC CAG ATC GTT GGC 96 Arg Arg Pro MetAsp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly 20 25 30 GGA GTT TAC TTGTTG CCG CGC AGG GGC CCT AGG TTG GGT GTG CGC GCG 144 Gly Val Tyr Leu LeuPro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 35 40 45 ACT CGG AAG ACT TCGGAG CGG TCG CAA CCT CGT GGG AGG CGC CAA CCT 192 Thr Arg Lys Thr Ser GluArg Ser Gln Pro Arg Gly Arg Arg Gln Pro 50 55 60 ATC CCC AAG GCG CGC CGATCC GAG GGC AGA TCC TGG GCG CAG CCC GGG 240 Ile Pro Lys Ala Arg Arg SerGlu Gly Arg Ser Trp Ala Gln Pro Gly 65 70 75 80 TAT CCT TGG CCC CTT TACGGC AAT GAG GGC TGT GGG TGG GCA GGG TGG 288 Tyr Pro Trp Pro Leu Tyr GlyAsn Glu Gly Cys Gly Trp Ala Gly Trp 85 90 95 CTC CTG TCC CCT CGC GGG TCTCGG CCG TCT TGG GGC CCT AAT GAT CCC 336 Leu Leu Ser Pro Arg Gly Ser ArgPro Ser Trp Gly Pro Asn Asp Pro 100 105 110 CGG CGG AGG TCC CGC AAC CTGGGT AAG GTC ATC GAT ACC CTA ACA TGC 384 Arg Arg Arg Ser Arg Asn Leu GlyLys Val Ile Asp Thr Leu Thr Cys 115 120 125 GGC TTC GCC GAC CTC ATG GGATAC ATC CCG CTT GTA GGC GCC CCC GTG 432 Gly Phe Ala Asp Leu Met Gly TyrIle Pro Leu Val Gly Ala Pro Val 130 135 140 GGT GGC GTC GCC AGA GCC CTGGCA CAC GGT GTT AGG GCT GTG GAA GAC 480 Gly Gly Val Ala Arg Ala Leu AlaHis Gly Val Arg Ala Val Glu Asp 145 150 155 160 GGG ATC AAC TAC GCA ACAG 499 Gly Ile Asn Tyr Ala Thr 165 (2) INFORMATION FOR SEQ ID NO: 164:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 166 amino acids (B) TYPE:amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 164: Met Ser Thr Asn Pro Lys Leu GlnArg Lys Thr Lys Arg Asn Thr Asn 1 5 10 15 Arg Arg Pro Met Asp Val LysPhe Pro Gly Gly Gly Gln Ile Val Gly 20 25 30 Gly Val Tyr Leu Leu Pro ArgArg Gly Pro Arg Leu Gly Val Arg Ala 35 40 45 Thr Arg Lys Thr Ser Glu ArgSer Gln Pro Arg Gly Arg Arg Gln Pro 50 55 60 Ile Pro Lys Ala Arg Arg SerGlu Gly Arg Ser Trp Ala Gln Pro Gly 65 70 75 80 Tyr Pro Trp Pro Leu TyrGly Asn Glu Gly Cys Gly Trp Ala Gly Trp 85 90 95 Leu Leu Ser Pro Arg GlySer Arg Pro Ser Trp Gly Pro Asn Asp Pro 100 105 110 Arg Arg Arg Ser ArgAsn Leu Gly Lys Val Ile Asp Thr Leu Thr Cys 115 120 125 Gly Phe Ala AspLeu Met Gly Tyr Ile Pro Leu Val Gly Ala Pro Val 130 135 140 Gly Gly ValAla Arg Ala Leu Ala His Gly Val Arg Ala Val Glu Asp 145 150 155 160 GlyIle Asn Tyr Ala Thr 165 (2) INFORMATION FOR SEQ ID NO: 165: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 499 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: DNA(genomic) (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 165: ATGAGCACGA ATCCTAAACC TCAAAGAAAA ACCAAACGTAACACCAACCG CCGCCCTATG 60 GACGTTAAGT TCCCAGGCGG TGGTCAGATC GTTGGCGGAGTTTACTTGTT GCCGCGCAGG 120 GGCCCCAGGT TGGGTGTGCG CGCGACTCGG AAGACTTCGGAGCGGTCGCA ACCTCGTGGG 180 AGGCGCCAAC CTATCCCCAA GGCGCGCCGA ACCGAGGGCAGATCCTGGGC GCAGCCCGGG 240 TATCCTTGGC CCCTTTACGG CAATGAGGGC TGTGGGTGGGCAGGGTGGCT CCTGTCCCCT 300 CGCGGNTCTC GGNCGTCTTG GGGCCCCAAT GATCCCCGGNGGAGATCCCG CAACTTGGGT 360 AAGGTCATCG ATACCCTAAC ATGCGGCTTC GCCGACCTCATGGGATACAT CCCGCTTGTA 420 GGCGCCCCCG TGGGTGGCGT CGCCAGGGCC CTGGCACATGGTGTTAGGGC TGTGGAAGAC 480 GGGATCAATT ATGCAACAG 499 (2) INFORMATION FORSEQ ID NO: 166: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 126 aminoacids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166:Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn 1 5 1015 Arg Arg Pro Met Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly 20 2530 Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 35 4045 Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro 50 5560 Ile Pro Lys Ala Arg Arg Thr Glu Gly Arg Ser Trp Ala Gln Pro Gly 65 7075 80 Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 8590 95 Leu Leu Ser Pro Arg Xaa Ser Arg Xaa Ser Trp Gly Pro Asn Asp Pro100 105 110 Arg Xaa Arg Ser Arg Asn Leu Gly Lys Val Ile Asp Thr Leu 115120 125 (2) INFORMATION FOR SEQ ID NO: 167: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 579 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY: CDS(B) LOCATION: 1..579 (ix) FEATURE: (A) NAME/KEY: mat_peptide (B)LOCATION: 1..579 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: ACA TGC GGCTTC GCC GAC CTC ATG GGA TAC ATC CCG CTT GTA GGC GCC 48 Thr Cys Gly PheAla Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 10 15 CCC GTG GGTGGC GTC GCC AGG GCC CTG GCA CAT GGT GTT AGG GCT GTG 96 Pro Val Gly GlyVal Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 20 25 30 GAA GAC GGG ATCAAT TAT GCA ACA GGG AAC CTT CCC GGT TGC TCC TTT 144 Glu Asp Gly Ile AsnTyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45 TCT ATC TTC CTC TTGGCG CTC CTC TCG TGC CTG ACT GTT CCC ACA TCG 192 Ser Ile Phe Leu Leu AlaLeu Leu Ser Cys Leu Thr Val Pro Thr Ser 50 55 60 GCC GTT AAC TAT CGC AATGCT TCG GGC ATT TAT CAC ATC ACC AAT GAC 240 Ala Val Asn Tyr Arg Asn AlaSer Gly Ile Tyr His Ile Thr Asn Asp 65 70 75 80 TGC CCG AAT GCA AGC ATAGTG TAC GAG ACC GAA AAT CAC ATC TTA CAC 288 Cys Pro Asn Ala Ser Ile ValTyr Glu Thr Glu Asn His Ile Leu His 85 90 95 CTC CCA GGG TGC GTA CCC TGTGTG AGG ACT GGG AAC CAG TCG CGG TGT 336 Leu Pro Gly Cys Val Pro Cys ValArg Thr Gly Asn Gln Ser Arg Cys 100 105 110 TGG GTG GCC CTC ACT CCC ACAGTA GCG TCG CCA TAC GCC GGT GCT CCG 384 Trp Val Ala Leu Thr Pro Thr ValAla Ser Pro Tyr Ala Gly Ala Pro 115 120 125 CTT GAG CCC TTG CGG CGT CATGTG GAC CTG ATG GTA GGT GCT GCC ACC 432 Leu Glu Pro Leu Arg Arg His ValAsp Leu Met Val Gly Ala Ala Thr 130 135 140 ATG TGT TCC GCC CTC TAC ATCGGC GAC TTG TGC GGT GGC TTA TTC TTG 480 Met Cys Ser Ala Leu Tyr Ile GlyAsp Leu Cys Gly Gly Leu Phe Leu 145 150 155 160 GTG GGC CAA ATG TTC ACCTTC CAA CCG CGA CGT CAC TGG ACC ACT CAG 528 Val Gly Gln Met Phe Thr PheGln Pro Arg Arg His Trp Thr Thr Gln 165 170 175 GAC TGC AAT TGT TCC ATCTAC ACG GGC CAC ATT ACG GGT CAT CGG ATG 576 Asp Cys Asn Cys Ser Ile TyrThr Gly His Ile Thr Gly His Arg Met 180 185 190 GCA 579 Ala (2)INFORMATION FOR SEQ ID NO: 168: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 193 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii)MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: ThrCys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 10 15Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 20 25 30Glu Asp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45Ser Ile Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser 50 55 60Ala Val Asn Tyr Arg Asn Ala Ser Gly Ile Tyr His Ile Thr Asn Asp 65 70 7580 Cys Pro Asn Ala Ser Ile Val Tyr Glu Thr Glu Asn His Ile Leu His 85 9095 Leu Pro Gly Cys Val Pro Cys Val Arg Thr Gly Asn Gln Ser Arg Cys 100105 110 Trp Val Ala Leu Thr Pro Thr Val Ala Ser Pro Tyr Ala Gly Ala Pro115 120 125 Leu Glu Pro Leu Arg Arg His Val Asp Leu Met Val Gly Ala AlaThr 130 135 140 Met Cys Ser Ala Leu Tyr Ile Gly Asp Leu Cys Gly Gly LeuPhe Leu 145 150 155 160 Val Gly Gln Met Phe Thr Phe Gln Pro Arg Arg HisTrp Thr Thr Gln 165 170 175 Asp Cys Asn Cys Ser Ile Tyr Thr Gly His IleThr Gly His Arg Met 180 185 190 Ala (2) INFORMATION FOR SEQ ID NO: 169:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 579 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 1..579 (ix) FEATURE: (A) NAME/KEY:mat_peptide (B) LOCATION: 1..576 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:169: ACA TGC GGC TTC GCC GAC CTC ATG GGA TAC ATC CCG CTT GTA GGC GCC 48Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 1015 CCC GTG GGT GGC GTC GCC AGA GCC CTG GCA CAC GGT GTT AGG GCT GTG 96Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 20 25 30GAA GAC GGG ATC AAC TAC GCA ACA GGG AAT CTC CCC GGT TGC TCC TTT 144 GluAsp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45 TCTATC TTC CTC TTG GCA CTT CTC TCG TGC CTC ACT GTT CCC GCG TCG 192 Ser IlePhe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 50 55 60 GGC GTTAAC TAT CGC AAT GCT TCG GGC GTT TAT CAC ATC ACC AAC GAC 240 Gly Val AsnTyr Arg Asn Ala Ser Gly Val Tyr His Ile Thr Asn Asp 65 70 75 80 TGC CCGAAT GCG AGC ATA GTG TAC GAG ACC GAC AAT CAC ATC TTA CAC 288 Cys Pro AsnAla Ser Ile Val Tyr Glu Thr Asp Asn His Ile Leu His 85 90 95 CTC CCA GGGTGC GTA CCC TGT GTG AAG ACC GGG AAC CAG TCG CGG TGT 336 Leu Pro Gly CysVal Pro Cys Val Lys Thr Gly Asn Gln Ser Arg Cys 100 105 110 TGG GTG GCCCTC ACT CCC ACA GTG GCG TCG CCT TAC GTC GGT GCT CCG 384 Trp Val Ala LeuThr Pro Thr Val Ala Ser Pro Tyr Val Gly Ala Pro 115 120 125 CTC GAG CCCTTG CGG CGC CAT GTG GAC CTG ATG GTA GGT GCT GCC ACC 432 Leu Glu Pro LeuArg Arg His Val Asp Leu Met Val Gly Ala Ala Thr 130 135 140 GTG TGC TCCGCC CTC TAC GTC GGC GAC CTG TGC GGT GGC TTA TTC TTG 480 Val Cys Ser AlaLeu Tyr Val Gly Asp Leu Cys Gly Gly Leu Phe Leu 145 150 155 160 GTA GGCCAA ATG TTC ACC TTC CAA CCG CGA CGC CAC TGG ACG ACC CAG 528 Val Gly GlnMet Phe Thr Phe Gln Pro Arg Arg His Trp Thr Thr Gln 165 170 175 GAC TGTAAT TGT TCC ATC TAC GCA GGG CAT ATT ACG GGC CAT CGG ATG 576 Asp Cys AsnCys Ser Ile Tyr Ala Gly His Ile Thr Gly His Arg Met 180 185 190 GCT 579Ala (2) INFORMATION FOR SEQ ID NO: 170: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 193 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170:Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 1015 Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 20 2530 Glu Asp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 4045 Ser Ile Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 50 5560 Gly Val Asn Tyr Arg Asn Ala Ser Gly Val Tyr His Ile Thr Asn Asp 65 7075 80 Cys Pro Asn Ala Ser Ile Val Tyr Glu Thr Asp Asn His Ile Leu His 8590 95 Leu Pro Gly Cys Val Pro Cys Val Lys Thr Gly Asn Gln Ser Arg Cys100 105 110 Trp Val Ala Leu Thr Pro Thr Val Ala Ser Pro Tyr Val Gly AlaPro 115 120 125 Leu Glu Pro Leu Arg Arg His Val Asp Leu Met Val Gly AlaAla Thr 130 135 140 Val Cys Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly GlyLeu Phe Leu 145 150 155 160 Val Gly Gln Met Phe Thr Phe Gln Pro Arg ArgHis Trp Thr Thr Gln 165 170 175 Asp Cys Asn Cys Ser Ile Tyr Ala Gly HisIle Thr Gly His Arg Met 180 185 190 Ala (2) INFORMATION FOR SEQ ID NO:171: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 579 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 1..579 (ix) FEATURE: (A) NAME/KEY:mat_peptide (B) LOCATION: 1..576 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:171: ACA TGC GGC TTC GCC GAC CTC ATG GGA TAC ATC CCG CTT GTG GGC GCC 48Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 1015 CCT GTT GGT GGC GTC GCC AGA GCC CTT GCG CAC GGC GTC AGG GCT GTG 96Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 20 25 30GAA GAC GGG ATT AAC TAT GCA ACA GGG AAC CTT CCT GGT TGC TCC TTT 144 GluAsp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45 TCTATC TTC CTT CTG GCA CTT CTC TCG TGC CTG ACT GTC CCC GCC TCG 192 Ser IlePhe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 50 55 60 GCT GTGCAT TAT CAC AAC ACC TCG GGC ATC TAC CAC CTC ACC AAT GAC 240 Ala Val HisTyr His Asn Thr Ser Gly Ile Tyr His Leu Thr Asn Asp 65 70 75 80 TGC CCTAAC TCT AGC ATA GTC TTT GAG GCA GTC CAT CAC ATC TTG CAC 288 Cys Pro AsnSer Ser Ile Val Phe Glu Ala Val His His Ile Leu His 85 90 95 CTT CCA GGATGC GTC CCT TGT GTA AGA ACT GGG AAC CAG TCT CGG TGC 336 Leu Pro Gly CysVal Pro Cys Val Arg Thr Gly Asn Gln Ser Arg Cys 100 105 110 TGG GTA GCCTTG ACC CCC ACG CTG GCC GCG CCA TAC CTT GGC GCT CCA 384 Trp Val Ala LeuThr Pro Thr Leu Ala Ala Pro Tyr Leu Gly Ala Pro 115 120 125 CTC GAG TCCATG CGG CGT CAC GTG GAT TTG ATG GTG GGC ACT GCT ACA 432 Leu Glu Ser MetArg Arg His Val Asp Leu Met Val Gly Thr Ala Thr 130 135 140 TTG TGC TCAGCA CTC TAC GTT GGG GAC CTG TGC GGG GGC ATA TTC CTA 480 Leu Cys Ser AlaLeu Tyr Val Gly Asp Leu Cys Gly Gly Ile Phe Leu 145 150 155 160 GCG GGCCAG ATG TTC ACC TTC CGG CCC CGC CTC CAT TGG ACC ACC CAG 528 Ala Gly GlnMet Phe Thr Phe Arg Pro Arg Leu His Trp Thr Thr Gln 165 170 175 GAG TGCAAT TGT TCC ACC TAT CCG GGC CAC ATC ACG GGT CAT AGA ATG 576 Glu Cys AsnCys Ser Thr Tyr Pro Gly His Ile Thr Gly His Arg Met 180 185 190 GCG 579Ala (2) INFORMATION FOR SEQ ID NO: 172: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 193 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172:Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 1015 Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 20 2530 Glu Asp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 4045 Ser Ile Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 50 5560 Ala Val His Tyr His Asn Thr Ser Gly Ile Tyr His Leu Thr Asn Asp 65 7075 80 Cys Pro Asn Ser Ser Ile Val Phe Glu Ala Val His His Ile Leu His 8590 95 Leu Pro Gly Cys Val Pro Cys Val Arg Thr Gly Asn Gln Ser Arg Cys100 105 110 Trp Val Ala Leu Thr Pro Thr Leu Ala Ala Pro Tyr Leu Gly AlaPro 115 120 125 Leu Glu Ser Met Arg Arg His Val Asp Leu Met Val Gly ThrAla Thr 130 135 140 Leu Cys Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly GlyIle Phe Leu 145 150 155 160 Ala Gly Gln Met Phe Thr Phe Arg Pro Arg LeuHis Trp Thr Thr Gln 165 170 175 Glu Cys Asn Cys Ser Thr Tyr Pro Gly HisIle Thr Gly His Arg Met 180 185 190 Ala (2) INFORMATION FOR SEQ ID NO:173: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 579 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 1..579 (ix) FEATURE: (A) NAME/KEY:mat_peptide (B) LOCATION: 1..576 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:173: ACG TGC GGT TCC GCC GAC CTC ATG GGA TAC ATC CCG CTC GTA GGC GCC 48Thr Cys Gly Ser Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 1015 CCT GTG GGT GGC GTC GCC AGG GCC TTG GCG CAT GGC GTC AGG GCT GTG 96Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 20 25 30GAG GAC GGG ATA AAC TAT GCA ACA GGG AAC CTT CCT GGT TGC TCT TTT 144 GluAsp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45 TCTATC TTC CTT CTG GCA CTT CTC TCG TGC CTG ACT GTC CCC GCC TCA 192 Ser IlePhe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 50 55 60 GCT GTGCAT TAT CAC AAC ACC TCG GGC ATC TAT CAC ATC ACT AAT GAC 240 Ala Val HisTyr His Asn Thr Ser Gly Ile Tyr His Ile Thr Asn Asp 65 70 75 80 TGC CCTAAC TCT AGC ATA GTC TTT GAG GCA GAG CAT CAC ATC TTG CAT 288 Cys Pro AsnSer Ser Ile Val Phe Glu Ala Glu His His Ile Leu His 85 90 95 CTT CCA GGATGC GTC CCC TGT GTG AGA ACT GGG AAC CAG TCA CGA TGC 336 Leu Pro Gly CysVal Pro Cys Val Arg Thr Gly Asn Gln Ser Arg Cys 100 105 110 TGG ATA GCCTTG ACC CCT ACG TTG GCC GCG CCA CAC ATT GGC GCT CCA 384 Trp Ile Ala LeuThr Pro Thr Leu Ala Ala Pro His Ile Gly Ala Pro 115 120 125 CTT GAG TCCATG CGA CGT CAT GTG GAT TTG ATG GTA GGC ACT GCC ACA 432 Leu Glu Ser MetArg Arg His Val Asp Leu Met Val Gly Thr Ala Thr 130 135 140 TTG TGC TCCGCA CTC TAC ATT GGA GAT CTG TGC GGA GGC ATA TTT CTA 480 Leu Cys Ser AlaLeu Tyr Ile Gly Asp Leu Cys Gly Gly Ile Phe Leu 145 150 155 160 GTG GGCCAG ATG TTC AAC TTC AGG CCC CGC CTG CAC TGG ACC ACC CAG 528 Val Gly GlnMet Phe Asn Phe Arg Pro Arg Leu His Trp Thr Thr Gln 165 170 175 GAG TGCAAT TGT TCC ATC TAT CCA GGC CAC ATC ACG GGT CAC AGA ATG 576 Glu Cys AsnCys Ser Ile Tyr Pro Gly His Ile Thr Gly His Arg Met 180 185 190 GCG 579Ala (2) INFORMATION FOR SEQ ID NO: 174: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 193 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174:Thr Cys Gly Ser Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 1015 Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 20 2530 Glu Asp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 4045 Ser Ile Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 50 5560 Ala Val His Tyr His Asn Thr Ser Gly Ile Tyr His Ile Thr Asn Asp 65 7075 80 Cys Pro Asn Ser Ser Ile Val Phe Glu Ala Glu His His Ile Leu His 8590 95 Leu Pro Gly Cys Val Pro Cys Val Arg Thr Gly Asn Gln Ser Arg Cys100 105 110 Trp Ile Ala Leu Thr Pro Thr Leu Ala Ala Pro His Ile Gly AlaPro 115 120 125 Leu Glu Ser Met Arg Arg His Val Asp Leu Met Val Gly ThrAla Thr 130 135 140 Leu Cys Ser Ala Leu Tyr Ile Gly Asp Leu Cys Gly GlyIle Phe Leu 145 150 155 160 Val Gly Gln Met Phe Asn Phe Arg Pro Arg LeuHis Trp Thr Thr Gln 165 170 175 Glu Cys Asn Cys Ser Ile Tyr Pro Gly HisIle Thr Gly His Arg Met 180 185 190 Ala (2) INFORMATION FOR SEQ ID NO:175: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 579 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 1..579 (ix) FEATURE: (A) NAME/KEY:mat_peptide (B) LOCATION: 1..576 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:175: ACG TGC GGC TTT GCC GAC CTC ATG GGA TAC ATC CCG CTC GTG GGC GCC 48Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 1015 CCT GTG GGT GGC GTC GCC AGG GCC TTG GCA CAT GGT GTC AGG GCC GTG 96Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 20 25 30GAG GAC GGG ATT AAC TAT GCA ACA GGG AAT CTT CCC GGT TGC TCC TTT 144 GluAsp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45 TCTATC TTC CTT CTA GCA CTT CTC TCG TGC TTG ACT GTC CCG GCC TCG 192 Ser IlePhe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 50 55 60 GCG CAGCAC TAC CGG AAC ATC TCG GGC ATT TAT CAC GTC ACC AAT GAC 240 Ala Gln HisTyr Arg Asn Ile Ser Gly Ile Tyr His Val Thr Asn Asp 65 70 75 80 TGC CCGAAC TCT AGT ATA GTG TAT GAA GCT GAC CAT CAT ATC ATG CAT 288 Cys Pro AsnSer Ser Ile Val Tyr Glu Ala Asp His His Ile Met His 85 90 95 CTA CCA GGGTGT GTG CCT TGC GTG AGA ACC GGG AAC ACC TCG CGC TGC 336 Leu Pro Gly CysVal Pro Cys Val Arg Thr Gly Asn Thr Ser Arg Cys 100 105 110 TGG GTT CCTTTA ACA CCC ACT GTG GCT GCC CCC TAT GTT GGC GCG CCG 384 Trp Val Pro LeuThr Pro Thr Val Ala Ala Pro Tyr Val Gly Ala Pro 115 120 125 CTC GAA TCCATG CGG CGG CAC GTG GAC TTA ATG GTG GGT GCC GCC ACC 432 Leu Glu Ser MetArg Arg His Val Asp Leu Met Val Gly Ala Ala Thr 130 135 140 GTC TGC TCGGCC CTG TAC ATC GGA GAC CTT TGC GGA GGT GTC TTC CTG 480 Val Cys Ser AlaLeu Tyr Ile Gly Asp Leu Cys Gly Gly Val Phe Leu 145 150 155 160 GTC GGGCAG ATG TTC ACC TTC CGG CCG CGC CGC CAT TGG ACT ACC CAG 528 Val Gly GlnMet Phe Thr Phe Arg Pro Arg Arg His Trp Thr Thr Gln 165 170 175 GAC TGCAAC TGC TCT ATC TAT GAT GGC CAC ATC ACC GGC CAT AGA ATG 576 Asp Cys AsnCys Ser Ile Tyr Asp Gly His Ile Thr Gly His Arg Met 180 185 190 GCT 579Ala (2) INFORMATION FOR SEQ ID NO: 176: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 193 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176:Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 1015 Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 20 2530 Glu Asp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 4045 Ser Ile Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 50 5560 Ala Gln His Tyr Arg Asn Ile Ser Gly Ile Tyr His Val Thr Asn Asp 65 7075 80 Cys Pro Asn Ser Ser Ile Val Tyr Glu Ala Asp His His Ile Met His 8590 95 Leu Pro Gly Cys Val Pro Cys Val Arg Thr Gly Asn Thr Ser Arg Cys100 105 110 Trp Val Pro Leu Thr Pro Thr Val Ala Ala Pro Tyr Val Gly AlaPro 115 120 125 Leu Glu Ser Met Arg Arg His Val Asp Leu Met Val Gly AlaAla Thr 130 135 140 Val Cys Ser Ala Leu Tyr Ile Gly Asp Leu Cys Gly GlyVal Phe Leu 145 150 155 160 Val Gly Gln Met Phe Thr Phe Arg Pro Arg ArgHis Trp Thr Thr Gln 165 170 175 Asp Cys Asn Cys Ser Ile Tyr Asp Gly HisIle Thr Gly His Arg Met 180 185 190 Ala (2) INFORMATION FOR SEQ ID NO:177: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 579 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 1..579 (ix) FEATURE: (A) NAME/KEY:mat_peptide (B) LOCATION: 1..576 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:177: ACG TGC GGG TTC GCC GAC CTC ATG GGA TAC ATC CCG CTC GTG GGC GCT 48Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 1015 CCA GTA GGA GGC GTC GCC AGA GCC TTG GCG CAT GGC GTC AGG GCT GTG 96Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 20 25 30GAG GAC GGG ATC AAT TAC GCA ACA GGG AAC CTT CCC GGC TGC TCC TTT 144 GluAsp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45 TCTATC TTC CTC TTG GTA CTT CTC TCG CGC CTA ACT GTC CCA GCG TCT 192 Ser IlePhe Leu Leu Val Leu Leu Ser Arg Leu Thr Val Pro Ala Ser 50 55 60 GCT CAGCAC TAC CGG AAT GCA TCG GGC ATC TAC CAT GTC ACC AAC GAC 240 Ala Gln HisTyr Arg Asn Ala Ser Gly Ile Tyr His Val Thr Asn Asp 65 70 75 80 TGC CCGAAC TCC AGT ATT GTG TAT GAA GCC GAC CAT CAC ATC ATG CAC 288 Cys Pro AsnSer Ser Ile Val Tyr Glu Ala Asp His His Ile Met His 85 90 95 CTA CCC GGGTGT GTG CCC TGT GTA AGA ACT GGG AAT GTC TCG CGT TGC 336 Leu Pro Gly CysVal Pro Cys Val Arg Thr Gly Asn Val Ser Arg Cys 100 105 110 TGG ATT CCTTTA ACA CCC ACT GTA GCC GTC CCC TAC CTC GGG GCT CCA 384 Trp Ile Pro LeuThr Pro Thr Val Ala Val Pro Tyr Leu Gly Ala Pro 115 120 125 CTT ACG TCTGTA CGG CAG CAT GTG GAC CTG ATG GTG GGG GCG GCC ACC 432 Leu Thr Ser ValArg Gln His Val Asp Leu Met Val Gly Ala Ala Thr 130 135 140 TTA TGC TCTGCC CTC TAC ATC GGA GAC CAT TGC GGA GGT GTC TTC TTG 480 Leu Cys Ser AlaLeu Tyr Ile Gly Asp His Cys Gly Gly Val Phe Leu 145 150 155 160 GCA GGGCAG ATG GTC AGT TTC CAA CCC CGG CGT CAT TGG ACT ACC CAG 528 Ala Gly GlnMet Val Ser Phe Gln Pro Arg Arg His Trp Thr Thr Gln 165 170 175 GAT TGCAAC TGT TCC ATC TAT GTG GGC CAC ATC ACC GGC CAC AGG ATG 576 Asp Cys AsnCys Ser Ile Tyr Val Gly His Ile Thr Gly His Arg Met 180 185 190 GCC 579Ala (2) INFORMATION FOR SEQ ID NO: 178: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 193 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178:Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 1015 Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 20 2530 Glu Asp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 4045 Ser Ile Phe Leu Leu Val Leu Leu Ser Arg Leu Thr Val Pro Ala Ser 50 5560 Ala Gln His Tyr Arg Asn Ala Ser Gly Ile Tyr His Val Thr Asn Asp 65 7075 80 Cys Pro Asn Ser Ser Ile Val Tyr Glu Ala Asp His His Ile Met His 8590 95 Leu Pro Gly Cys Val Pro Cys Val Arg Thr Gly Asn Val Ser Arg Cys100 105 110 Trp Ile Pro Leu Thr Pro Thr Val Ala Val Pro Tyr Leu Gly AlaPro 115 120 125 Leu Thr Ser Val Arg Gln His Val Asp Leu Met Val Gly AlaAla Thr 130 135 140 Leu Cys Ser Ala Leu Tyr Ile Gly Asp His Cys Gly GlyVal Phe Leu 145 150 155 160 Ala Gly Gln Met Val Ser Phe Gln Pro Arg ArgHis Trp Thr Thr Gln 165 170 175 Asp Cys Asn Cys Ser Ile Tyr Val Gly HisIle Thr Gly His Arg Met 180 185 190 Ala (2) INFORMATION FOR SEQ ID NO:179: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 579 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 1..579 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:179: ACCTGCGGCT TCGCCGACCT CATGGGATAC ATCCCGCTCG TAGGCGCCCC CGTGGGAGGC60 GTCGCCAGAR CTCTGGCGCA TGGCGTCAGG GCTCTGGAAG ACGGGATCAA TTATGCAACA 120GGGAATCTTC CTGGTTGCTC TTTCTCTATC TCCCTTCTTG AACTTCTCTC GTGCCTGACT 180GTTCCCGCCT CAGCCATCCA CTATCGCAAT GCTTCGGACG GTTATTATAT CACCAATGAT 240TGCCCGAACT CTAGCATAGT GTATGAAGCC GAGAACCACA TCTTGCACCT TCCGGGGTGT 300ATACCCTGTG TGAAGACCGG GAATCAGTCG CGGTGCTGGG TGGCTCTCAC CCCCACGCTG 360GCGGCCCCAC ACCTACGTGC TCCGCTTTCG TCCTTACGGG CGCATGTGGA CCTAATGGTG 420GGGGCCGCCA CGGCATGCTC CGCTTTTTAC ATTGGAGATC TGTGCGGGGG TGTGTTTTTG 480GCGGGCCAAC TGTTCACTAT CCGGCCACGC ATTCATGAAA CCACTCAGGA CTGCAATTGC 540TCCATCTACT CAGGGCACAT CACGGGTNNN NNNNNNNNN 579 (2) INFORMATION FOR SEQID NO: 180: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 193 amino acids(B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii)MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: ThrCys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 10 15Pro Val Gly Gly Val Ala Arg Xaa Leu Ala His Gly Val Arg Ala Leu 20 25 30Glu Asp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45Ser Ile Ser Leu Leu Glu Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 50 55 60Ala Ile His Tyr Arg Asn Ala Ser Asp Gly Tyr Tyr Ile Thr Asn Asp 65 70 7580 Cys Pro Asn Ser Ser Ile Val Tyr Glu Ala Glu Asn His Ile Leu His 85 9095 Leu Pro Gly Cys Ile Pro Cys Val Lys Thr Gly Asn Gln Ser Arg Cys 100105 110 Trp Val Ala Leu Thr Pro Thr Leu Ala Ala Pro His Leu Arg Ala Pro115 120 125 Leu Ser Ser Leu Arg Ala His Val Asp Leu Met Val Gly Ala AlaThr 130 135 140 Ala Cys Ser Ala Phe Tyr Ile Gly Asp Leu Cys Gly Gly ValPhe Leu 145 150 155 160 Ala Gly Gln Leu Phe Thr Ile Arg Pro Arg Ile HisGlu Thr Thr Gln 165 170 175 Asp Cys Asn Cys Ser Ile Tyr Ser Gly His IleThr Gly Xaa Xaa Xaa 180 185 190 Xaa (2) INFORMATION FOR SEQ ID NO: 181:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 579 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 1..578 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:181: GCGTGCGGCT TCGCCGATCT CATGGGATAC ATCCCGCTCG TAGGCGCCCC CGTGGGTGGC60 GTCGCCAGAG CCCTGGCGCA CGGTGTTAGG GCTGTGGAGG ACGGGATTAA CTACGCAACA 120GGGAATCTTC CTGGTTGCTC TTTCTCTATC TNCCTTCTGG CACTTCTCTC GTGCCTGACT 180GTCCCGGCCT CGGCTCAGCA CTACCGGAAT GTCTCGGGCA TCTACCACGT CACCAATGAT 240TGCCCGAATT CCAGCATAGT GTATGAAGCC GATCACCACA TCATGCACTT ACCAGGGTGC 300ATACCCTGCG TGAGGACCGG GAACGTTTCG CGCTGCTGGG TATCTCTGAC ACCTACTGTG 360GCTGCTCCCT ACCTCGGGGC TCCGCTTACG TCGCTACGGC GGCATGTGGA TTTGATGGTG 420GGTGCAGCCA CCCTTTGCTC TGCCCTCTAC GTCGGAGACC TCTGTGGAGG TGTCTTCCTA 480GTGGGACAGA TGTTCACCTT CCAGCCGCGC CGCCACTGGA CCACTCAGGA CTGCAACTGC 540TCCATTTACG TCGGCCACAT CACAGGCCAC AGAATGGCT 579 (2) INFORMATION FOR SEQID NO: 182: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 193 amino acids(B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii)MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: AlaCys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 10 15Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 20 25 30Glu Asp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45Ser Ile Xaa Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 50 55 60Ala Gln His Tyr Arg Asn Val Ser Gly Ile Tyr His Val Thr Asn Asp 65 70 7580 Cys Pro Asn Ser Ser Ile Val Tyr Glu Ala Asp His His Ile Met His 85 9095 Leu Pro Gly Cys Ile Pro Cys Val Arg Thr Gly Asn Val Ser Arg Cys 100105 110 Trp Val Ser Leu Thr Pro Thr Val Ala Ala Pro Tyr Leu Gly Ala Pro115 120 125 Leu Thr Ser Leu Arg Arg His Val Asp Leu Met Val Gly Ala AlaThr 130 135 140 Leu Cys Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly ValPhe Leu 145 150 155 160 Val Gly Gln Met Phe Thr Phe Gln Pro Arg Arg HisTrp Thr Thr Gln 165 170 175 Asp Cys Asn Cys Ser Ile Tyr Val Gly His IleThr Gly His Arg Met 180 185 190 Ala (2) INFORMATION FOR SEQ ID NO: 183:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 579 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 1..579 (ix) FEATURE: (A) NAME/KEY:mat_peptide (B) LOCATION: 1..579 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:183: ACC TGC GGC TTT GCC GAC CTC ATG GGA TAC ATC CCG CTC GTA GGC GCC 48Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 1015 CCT GTG GGT GGC GTC GCC AGG GCC CTA GAA CAC GGT GTT AGG GCT GTG 96Pro Val Gly Gly Val Ala Arg Ala Leu Glu His Gly Val Arg Ala Val 20 25 30GAG GAC GGT ATT AAT TAT GCA ACA GGG AAT CTC CCC GGT TGC TCT TTT 144 GluAsp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45 TCTATC TCC CTC TTG GCA CTT CTT TCG TGC CTG ACT GTT CCC ACC TCA 192 Ser IleSer Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser 50 55 60 GCC GTCAAC TAT CGC AAC GCC TCG GGC GTC TAT CAT ATC ACC AAT GAC 240 Ala Val AsnTyr Arg Asn Ala Ser Gly Val Tyr His Ile Thr Asn Asp 65 70 75 80 TGC CCGAAT TCG AGC ATA GTG TAC GAG GCT GAC TAC CAC ATC CTA CAC 288 Cys Pro AsnSer Ser Ile Val Tyr Glu Ala Asp Tyr His Ile Leu His 85 90 95 CTC CCT GGGTGC TTA CCC TGC GTG AGG GTT GGG AAT CAG TCA CGC TGC 336 Leu Pro Gly CysLeu Pro Cys Val Arg Val Gly Asn Gln Ser Arg Cys 100 105 110 TGG GTG GCCCTT ACT CCC ACC GTG GCG GCG CCT TAC GTT GGT GCT CCG 384 Trp Val Ala LeuThr Pro Thr Val Ala Ala Pro Tyr Val Gly Ala Pro 115 120 125 CTA GAA TCCCTC CGG AGT CAT GTG GAT CTG ATG GTA GGT GCT GCT ACT 432 Leu Glu Ser LeuArg Ser His Val Asp Leu Met Val Gly Ala Ala Thr 130 135 140 GTG TGC TCCGCT CTT TAC ATC GGG GAC CTG TGC GGT GGC GTA TTT TTG 480 Val Cys Ser AlaLeu Tyr Ile Gly Asp Leu Cys Gly Gly Val Phe Leu 145 150 155 160 GTT GGTCAG ATG TTT TCT TTC CAG CCG CGA CGC CAC TGG ACC ACG CAG 528 Val Gly GlnMet Phe Ser Phe Gln Pro Arg Arg His Trp Thr Thr Gln 165 170 175 GAC TGCAAT TGT TCT ATC TAC GCG GGG CAC GTT ACG GGC CAC AGG ATG 576 Asp Cys AsnCys Ser Ile Tyr Ala Gly His Val Thr Gly His Arg Met 180 185 190 GCA 579Ala (2) INFORMATION FOR SEQ ID NO: 184: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 193 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184:Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 1015 Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 20 2530 Glu Asp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 4045 Ser Ile Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser 50 5560 Ala Val Asn Tyr Arg Asn Ala Ser Gly Ile Tyr His Ile Thr Asn Asp 65 7075 80 Cys Pro Asn Ala Ser Ile Val Tyr Glu Thr Glu Asn His Ile Leu His 8590 95 Leu Pro Gly Cys Val Pro Cys Val Arg Thr Gly Asn Gln Ser Arg Cys100 105 110 Trp Val Ala Leu Thr Pro Thr Val Ala Ser Pro Tyr Ala Gly AlaPro 115 120 125 Leu Glu Pro Leu Arg Arg His Val Asp Leu Met Val Gly AlaAla Thr 130 135 140 Met Cys Ser Ala Leu Tyr Ile Gly Asp Leu Cys Gly GlyLeu Phe Leu 145 150 155 160 Val Gly Gln Met Phe Thr Phe Gln Pro Arg ArgHis Trp Thr Thr Gln 165 170 175 Asp Cys Asn Cys Ser Ile Tyr Thr Gly HisIle Thr Gly His Arg Met 180 185 190 Ala (2) INFORMATION FOR SEQ ID NO:185: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 579 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 1..579 (ix) FEATURE: (A) NAME/KEY:mat_peptide (B) LOCATION: 1..576 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:185: ACT TGC GGC TTT GCC GAC CTC ATG GGA TAC ATC CCG CTC GTA GGC GCC 48Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 1015 CCC GTG GGT GGC GTC GCC AGA GCC CTG GAA CAT GGT GTT AGG GCT GTG 96Pro Val Gly Gly Val Ala Arg Ala Leu Glu His Gly Val Arg Ala Val 20 25 30GAG GAC GGC ATC AAT TAT GCA ACA GGG AAT CTC CCC GGT TGC TCT TTC 144 GluAsp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45 TCTATC TAC CTC TTG GCA CTT CTC TCG TGC CTG ACT GTT CCC ACC TCG 192 Ser IleTyr Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser 50 55 60 GCC ATCCAC TAT CGC AAT GCC TCG GGC GTC TAC CAC GTC ACC AAT GAC 240 Ala Ile HisTyr Arg Asn Ala Ser Gly Val Tyr His Val Thr Asn Asp 65 70 75 80 TGC CCGAAC TCG AGC ATA GTG TAC GAG GCC GAC CAC CAC ATC CTA CAC 288 Cys Pro AsnSer Ser Ile Val Tyr Glu Ala Asp His His Ile Leu His 85 90 95 CTT CCA GGGTGC TTA CCC TGT GTG AGG GTT GGG AAT CAG TCA CGT TGT 336 Leu Pro Gly CysLeu Pro Cys Val Arg Val Gly Asn Gln Ser Arg Cys 100 105 110 TGG GTG GCCCTC TCT CCC ACC GTG GCG GCG CCT TAC ATC GGT GCT CCA 384 Trp Val Ala LeuSer Pro Thr Val Ala Ala Pro Tyr Ile Gly Ala Pro 115 120 125 GTT GAA TCCTTC CGG AGA CAC GTG GAC ATG ATG GTG GGC GCT GCT ACT 432 Val Glu Ser PheArg Arg His Val Asp Met Met Val Gly Ala Ala Thr 130 135 140 GTG TGC TCCGCT CTC TAT ATT GGG GAC TTG TGT GGT GGC GTA TTC TTG 480 Val Cys Ser AlaLeu Tyr Ile Gly Asp Leu Cys Gly Gly Val Phe Leu 145 150 155 160 GTT GGTCAG ATG TTT TCT TTC CGG CCA CGA CGC CAC TGG ACT ACG CAG 528 Val Gly GlnMet Phe Ser Phe Arg Pro Arg Arg His Trp Thr Thr Gln 165 170 175 GAC TGCAAT TGT TCC ATC TAC GCG GGG CAC ATC ACT GGC CAC GGA ATG 576 Asp Cys AsnCys Ser Ile Tyr Ala Gly His Ile Thr Gly His Gly Met 180 185 190 GCA 579Ala (2) INFORMATION FOR SEQ ID NO: 186: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 193 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186:Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 1015 Pro Val Gly Gly Val Ala Arg Ala Leu Glu His Gly Val Arg Ala Val 20 2530 Glu Asp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 4045 Ser Ile Tyr Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser 50 5560 Ala Ile His Tyr Arg Asn Ala Ser Gly Val Tyr His Val Thr Asn Asp 65 7075 80 Cys Pro Asn Ser Ser Ile Val Tyr Glu Ala Asp His His Ile Leu His 8590 95 Leu Pro Gly Cys Leu Pro Cys Val Arg Val Gly Asn Gln Ser Arg Cys100 105 110 Trp Val Ala Leu Ser Pro Thr Val Ala Ala Pro Tyr Ile Gly AlaPro 115 120 125 Val Glu Ser Phe Arg Arg His Val Asp Met Met Val Gly AlaAla Thr 130 135 140 Val Cys Ser Ala Leu Tyr Ile Gly Asp Leu Cys Gly GlyVal Phe Leu 145 150 155 160 Val Gly Gln Met Phe Ser Phe Arg Pro Arg ArgHis Trp Thr Thr Gln 165 170 175 Asp Cys Asn Cys Ser Ile Tyr Ala Gly HisIle Thr Gly His Gly Met 180 185 190 Ala (2) INFORMATION FOR SEQ ID NO:187: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 579 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 1..579 (ix) FEATURE: (A) NAME/KEY:mat_peptide (B) LOCATION: 1..576 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:187: ACT TGC GGC TTT GCC GAC CTC ATG GGA TAC ATC CCG CTC GTA GGC GCC 48Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 1015 CCT GTG GGT GGC GTC GCC AGG GCC CTG GCA CAC GGT GTT AGG GCT GTG 96Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 20 25 30GAG GAC GGG ATC AAT TAT GCG ACA GGG AAT CTT CCC GGT TGC TCT TTC 144 GluAsp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45 TCTATC TTC CTC TTG GCA CTT CTT TCG TGC CTG ACT GTT CCC ACC TCG 192 Ser IlePhe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser 50 55 60 GCC GTCAAC TAT CGC AAT GCC TCG GGC ATC TAT CAC ATC ACC AAT GAC 240 Ala Val AsnTyr Arg Asn Ala Ser Gly Ile Tyr His Ile Thr Asn Asp 65 70 75 80 TGC CCGAAC TCG AGC ATA GTG TAC GAG ACC GAG CAC CAC ATC CTA CAC 288 Cys Pro AsnSer Ser Ile Val Tyr Glu Thr Glu His His Ile Leu His 85 90 95 CTC CCA GGGTGT TTA CCC TGC GTG AGG GTT GGG AAT CAG TCA CGC TGC 336 Leu Pro Gly CysLeu Pro Cys Val Arg Val Gly Asn Gln Ser Arg Cys 100 105 110 TGG GTG GCCCTC ACT CCC ACC GTG GCG GCG CCT TAC ATC GGC GCT CCG 384 Trp Val Ala LeuThr Pro Thr Val Ala Ala Pro Tyr Ile Gly Ala Pro 115 120 125 CTT GAA TCCCTC CGG AGT CAT GTG GAT CTG ATG GTA GGT GCC GCT ACT 432 Leu Glu Ser LeuArg Ser His Val Asp Leu Met Val Gly Ala Ala Thr 130 135 140 GCG TGC TCCGCT CTT TAC ATC GGA GAC CTG TGC GGT GGC GTA TTT TTG 480 Ala Cys Ser AlaLeu Tyr Ile Gly Asp Leu Cys Gly Gly Val Phe Leu 145 150 155 160 GTT GGTCAG ATG TTC TCT TTC CAG CCG CGG CGC CAC TGG ACT ACG CAG 528 Val Gly GlnMet Phe Ser Phe Gln Pro Arg Arg His Trp Thr Thr Gln 165 170 175 GAC TGCAAT TGT TCC ATC TAC GCG GGG CAC GTT ACG GGC CAC AGG ATG 576 Asp Cys AsnCys Ser Ile Tyr Ala Gly His Val Thr Gly His Arg Met 180 185 190 GCA 579Ala (2) INFORMATION FOR SEQ ID NO: 188: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 193 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188:Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 1015 Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 20 2530 Glu Asp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 4045 Ser Ile Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Thr Ser 50 5560 Ala Val Asn Tyr Arg Asn Ala Ser Gly Ile Tyr His Ile Thr Asn Asp 65 7075 80 Cys Pro Asn Ser Ser Ile Val Tyr Glu Thr Glu His His Ile Leu His 8590 95 Leu Pro Gly Cys Leu Pro Cys Val Arg Val Gly Asn Gln Ser Arg Cys100 105 110 Trp Val Ala Leu Thr Pro Thr Val Ala Ala Pro Tyr Ile Gly AlaPro 115 120 125 Leu Glu Ser Leu Arg Ser His Val Asp Leu Met Val Gly AlaAla Thr 130 135 140 Ala Cys Ser Ala Leu Tyr Ile Gly Asp Leu Cys Gly GlyVal Phe Leu 145 150 155 160 Val Gly Gln Met Phe Ser Phe Gln Pro Arg ArgHis Trp Thr Thr Gln 165 170 175 Asp Cys Asn Cys Ser Ile Tyr Ala Gly HisVal Thr Gly His Arg Met 180 185 190 Ala (2) INFORMATION FOR SEQ ID NO:189: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 579 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 1..579 (ix) FEATURE: (A) NAME/KEY:mat_peptide (B) LOCATION: 1..576 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:189: ACG TGC GGC TTC GCC GAC CTC ATG GGA TAC ATC CCG CTC GTG GGC GCC 48Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 1015 CCC GTT GGG GGC GTC GCC AGG GCC CTG GCG CAT GGC GTC AGG GCT GTG 96Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 20 25 30GAG GAC GGG ATT AAC TAT GCG ACA GGG AAT CTT CCC GGT TGC TCT TTC 144 GluAsp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45 TCTATC TTC CTC CTG GCA CTT CTT TCG TGC CTC ACT GTC CCA GCG TCA 192 Ser IlePhe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 50 55 60 GCT GAGCAC TAC CGG AAT GCT TCG GGC ATC TAT CAC ATC ACC AAT GAC 240 Ala Glu HisTyr Arg Asn Ala Ser Gly Ile Tyr His Ile Thr Asn Asp 65 70 75 80 TGT CCGAAT TCC AGC GTA GTC TAT GAA ACT GAC CAC CAT ATA TTG CAC 288 Cys Pro AsnSer Ser Val Val Tyr Glu Thr Asp His His Ile Leu His 85 90 95 TTG CCG GGGTGC GTA CCC TGC GTG AGG GCC GGG AAC GTG TCT CGT TGC 336 Leu Pro Gly CysVal Pro Cys Val Arg Ala Gly Asn Val Ser Arg Cys 100 105 110 TGG ACG CCGGTA ACA CCT ACG GTG GCT GCC GTA TCC ATG GAC GCT CCG 384 Trp Thr Pro ValThr Pro Thr Val Ala Ala Val Ser Met Asp Ala Pro 115 120 125 CTC GAG TCCTTC CGG CGG CAT GTG GAC CTA ATG GTA GGT GCG GCC ACC 432 Leu Glu Ser PheArg Arg His Val Asp Leu Met Val Gly Ala Ala Thr 130 135 140 GTG TGT TCTGTC CTC TAT GTT GGA GAC CTC TGT GGA GGT GCT TTC CTA 480 Val Cys Ser ValLeu Tyr Val Gly Asp Leu Cys Gly Gly Ala Phe Leu 145 150 155 160 GTG GGGCAG ATG TTC ACC TTC CAG CCG CGT CGC CAC TGG ACC ACG CAG 528 Val Gly GlnMet Phe Thr Phe Gln Pro Arg Arg His Trp Thr Thr Gln 165 170 175 GAT TGTAAT TGC TCC ATC TAT ACT GGC CAT ATC ACC GGC CAC AGG ATG 576 Asp Cys AsnCys Ser Ile Tyr Thr Gly His Ile Thr Gly His Arg Met 180 185 190 GCG 579Ala (2) INFORMATION FOR SEQ ID NO: 190: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 193 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190:Thr Cys Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val Gly Ala 1 5 1015 Pro Val Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Ala Val 20 2530 Glu Asp Gly Ile Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe 35 4045 Ser Ile Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser 50 5560 Ala Glu His Tyr Arg Asn Ala Ser Gly Ile Tyr His Ile Thr Asn Asp 65 7075 80 Cys Pro Asn Ser Ser Val Val Tyr Glu Thr Asp His His Ile Leu His 8590 95 Leu Pro Gly Cys Val Pro Cys Val Arg Ala Gly Asn Val Ser Arg Cys100 105 110 Trp Thr Pro Val Thr Pro Thr Val Ala Ala Val Ser Met Asp AlaPro 115 120 125 Leu Glu Ser Phe Arg Arg His Val Asp Leu Met Val Gly AlaAla Thr 130 135 140 Val Cys Ser Val Leu Tyr Val Gly Asp Leu Cys Gly GlyAla Phe Leu 145 150 155 160 Val Gly Gln Met Phe Thr Phe Gln Pro Arg ArgHis Trp Thr Thr Gln 165 170 175 Asp Cys Asn Cys Ser Ile Tyr Thr Gly HisIle Thr Gly His Arg Met 180 185 190 Ala (2) INFORMATION FOR SEQ ID NO:191: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 289 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 1..289 (ix) FEATURE: (A) NAME/KEY:mat_peptide (B) LOCATION: 1..286 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:191: ATG AGC ACG AAT CCT AAA CCT CAA AGA AAA ACC AAA CGT AAC ACC AAC 48Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn 1 5 1015 CGC CGC CCC ATG GAC GTT AAG TTC CCG GGC GGT GGC CAG ATC GTT GGT 96Arg Arg Pro Met Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly 20 25 30GGA GTT TAC TTG TTG CCG CGC AGG GGC CCC AGG TTG GGT GTG CGC GCG 144 GlyVal Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 35 40 45 ACTAGG AAG ACT TCG GAG CGG TCG CAA CCT CGT GGG AGA CGT CAG CCT 192 Thr ArgLys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro 50 55 60 ATC CCCAAG GCA CGT CGA TCT GAG GGA AGG TCC TGG GCT CAG CCC GGG 240 Ile Pro LysAla Arg Arg Ser Glu Gly Arg Ser Trp Ala Gln Pro Gly 65 70 75 80 TAC CCATGG CCT CTT TAC GGT AAT GAG GGT TGT GGG TGG GCA GGA TGG G 289 Tyr ProTrp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 85 90 95 (2)INFORMATION FOR SEQ ID NO: 192: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 96 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii)MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: MetSer Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg Asn Thr Asn 1 5 10 15Arg Arg Pro Met Asp Val Lys Phe Pro Gly Gly Gly Gln Ile Val Gly 20 25 30Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 35 40 45Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly Arg Arg Gln Pro 50 55 60Ile Pro Lys Ala Arg Arg Ser Glu Gly Arg Ser Trp Ala Gln Pro Gly 65 70 7580 Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp 85 9095 (2) INFORMATION FOR SEQ ID NO: 193: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 498 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single(D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO(iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION:1..498 (ix) FEATURE: (A) NAME/KEY: mat_peptide (B) LOCATION: 1..495 (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 193: ATG AGC ACG AAT CCT AAA CCT CAAAGA AAA ACC AAA CGT AAC ACC AAC 48 Met Ser Thr Asn Pro Lys Pro Gln ArgLys Thr Lys Arg Asn Thr Asn 1 5 10 15 CGC CGC CCT ATG GAC GTA AAG TTCCCG GGC GGT GGA CAG ATC GTT GGC 96 Arg Arg Pro Met Asp Val Lys Phe ProGly Gly Gly Gln Ile Val Gly 20 25 30 GGA GTT TAC TTG TTG CCG CGC AGG GGCCCC CGG TTG GGT GTG CGC GCG 144 Gly Val Tyr Leu Leu Pro Arg Arg Gly ProArg Leu Gly Val Arg Ala 35 40 45 ACT CGG AAG ACT TCG GAG CGG TCG CAA CCTCGT GGC AGG CGT CAA CCT 192 Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro ArgGly Arg Arg Gln Pro 50 55 60 ATC CCC AAG GCG CGC CGG TCC GAG GGC AGG TCCTGG GCG CAA GCC GGG 240 Ile Pro Lys Ala Arg Arg Ser Glu Gly Arg Ser TrpAla Gln Ala Gly 65 70 75 80 TAC CCC TGG CCC CTC TAT GGC AAT GAG GGC TGTGGG TGG GCA GGG TGG 288 Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys GlyTrp Ala Gly Trp 85 90 95 CTC CTG TCT CCT CGC GGC TCT CGG CCA TCT TGG GGCCCA AAT GAT CCC 336 Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly ProAsn Asp Pro 100 105 110 CGG CGG AGA TCG CGC AAT CTG GGT AAG GTC ATC GATACC CTG ACG TGC 384 Arg Arg Arg Ser Arg Asn Leu Gly Lys Val Ile Asp ThrLeu Thr Cys 115 120 125 GGC TTC GCC GAC CTC ATG GGA TAC ATC CCG CTC GTGGGC GCC CCC GTC 432 Gly Phe Ala Asp Leu Met Gly Tyr Ile Pro Leu Val GlyAla Pro Val 130 135 140 GGG GGC GTC GCC AGG GCC CTG GCG CAT GGC GTC AGGGCT GTG GAG GAC 480 Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg AlaVal Glu Asp 145 150 155 160 GGG ATT AAC TAT CGA CAG 498 Gly Ile Asn TyrArg Gln 165 (2) INFORMATION FOR SEQ ID NO: 194: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 166 amino acids (B) TYPE: amino acid (D)TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 194: Met Ser Thr Asn Pro Lys Pro Gln Arg Lys Thr Lys Arg AsnThr Asn 1 5 10 15 Arg Arg Pro Met Asp Val Lys Phe Pro Gly Gly Gly GlnIle Val Gly 20 25 30 Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu GlyVal Arg Ala 35 40 45 Thr Arg Lys Thr Ser Glu Arg Ser Gln Pro Arg Gly ArgArg Gln Pro 50 55 60 Ile Pro Lys Ala Arg Arg Ser Glu Gly Arg Ser Trp AlaGln Ala Gly 65 70 75 80 Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Cys GlyTrp Ala Gly Trp 85 90 95 Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp GlyPro Asn Asp Pro 100 105 110 Arg Arg Arg Ser Arg Asn Leu Gly Lys Val IleAsp Thr Leu Thr Cys 115 120 125 Gly Phe Ala Asp Leu Met Gly Tyr Ile ProLeu Val Gly Ala Pro Val 130 135 140 Gly Gly Val Ala Arg Ala Leu Ala HisGly Val Arg Ala Val Glu Asp 145 150 155 160 Gly Ile Asn Tyr Arg Gln 165(2) INFORMATION FOR SEQ ID NO: 195: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 579 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single(D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO(iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION:1..579 (ix) FEATURE: (A) NAME/KEY: mat_peptide (B) LOCATION: 1..576 (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 195: ACG TGC GGA TTC GCC GAC CTC GTGGGG TAC ATC CCG CTC GTA GGC GGC 48 Thr Cys Gly Phe Ala Asp Leu Val GlyTyr Ile Pro Leu Val Gly Gly 1 5 10 15 CCC GTT GGG GGC GTC GCA AGG GCTCTC GCA CAT GGT GTG AGG GTT CTT 96 Pro Val Gly Gly Val Ala Arg Ala LeuAla His Gly Val Arg Val Leu 20 25 30 GAG GAC GGG GTG AAT TAT GCA ACA GGGAAT CTG CCT GGT TGC TCT TTC 144 Glu Asp Gly Val Asn Tyr Ala Thr Gly AsnLeu Pro Gly Cys Ser Phe 35 40 45 TCT ATC TTC ATT CTT GCA CTT CTC TCG TGCCTC ACT GTC CCG GCC TCT 192 Ser Ile Phe Ile Leu Ala Leu Leu Ser Cys LeuThr Val Pro Ala Ser 50 55 60 GCA GTT CCC TAC CGA AAT GCC TCT GGG ATC TATCAT GTC ACC AAT GAT 240 Ala Val Pro Tyr Arg Asn Ala Ser Gly Ile Tyr HisVal Thr Asn Asp 65 70 75 80 TGC CCA AAC TCT TCC ATA GTC TAT GAG GCA GATGAT CTG ATC CTA CAC 288 Cys Pro Asn Ser Ser Ile Val Tyr Glu Ala Asp AspLeu Ile Leu His 85 90 95 GCA CCT GGC TGC GTG CCT TGT GTC AGG AAA GAT AATGTG AGT AGG TGC 336 Ala Pro Gly Cys Val Pro Cys Val Arg Lys Asp Asn ValSer Arg Cys 100 105 110 TGG GTC CAA ATT ACC CCC ACG CTG TCA GCC CCG AGCTTC GGA GCA GTC 384 Trp Val Gln Ile Thr Pro Thr Leu Ser Ala Pro Ser PheGly Ala Val 115 120 125 ACG GCT CCC CTT CGG AGA GCC GTT GAT TAC TTG GTGGGA GGG GCT GCC 432 Thr Ala Pro Leu Arg Arg Ala Val Asp Tyr Leu Val GlyGly Ala Ala 130 135 140 CTC TGC TCC GCG TTA TAC GTT GGA GAC GCG TGT GGGGCA CTA TTT TTG 480 Leu Cys Ser Ala Leu Tyr Val Gly Asp Ala Cys Gly AlaLeu Phe Leu 145 150 155 160 GTA GGC CAA ATG TTC ACC TAT AGG CCT CGC CAGCAT GCT ACG GTG CAG 528 Val Gly Gln Met Phe Thr Tyr Arg Pro Arg Gln HisAla Thr Val Gln 165 170 175 GAC TGC AAC TGT TCC ATC TAC AGT GGC CAC GTCACC GGC CAT CAG ATG 576 Asp Cys Asn Cys Ser Ile Tyr Ser Gly His Val ThrGly His Gln Met 180 185 190 GCA 579 Ala (2) INFORMATION FOR SEQ ID NO:196: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 193 amino acids (B) TYPE:amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 196: Thr Cys Gly Phe Ala Asp Leu ValGly Tyr Ile Pro Leu Val Gly Gly 1 5 10 15 Pro Val Gly Gly Val Ala ArgAla Leu Ala His Gly Val Arg Val Leu 20 25 30 Glu Asp Gly Val Asn Tyr AlaThr Gly Asn Leu Pro Gly Cys Ser Phe 35 40 45 Ser Ile Phe Ile Leu Ala LeuLeu Ser Cys Leu Thr Val Pro Ala Ser 50 55 60 Ala Val Pro Tyr Arg Asn AlaSer Gly Ile Tyr His Val Thr Asn Asp 65 70 75 80 Cys Pro Asn Ser Ser IleVal Tyr Glu Ala Asp Asp Leu Ile Leu His 85 90 95 Ala Pro Gly Cys Val ProCys Val Arg Lys Asp Asn Val Ser Arg Cys 100 105 110 Trp Val Gln Ile ThrPro Thr Leu Ser Ala Pro Ser Phe Gly Ala Val 115 120 125 Thr Ala Pro LeuArg Arg Ala Val Asp Tyr Leu Val Gly Gly Ala Ala 130 135 140 Leu Cys SerAla Leu Tyr Val Gly Asp Ala Cys Gly Ala Leu Phe Leu 145 150 155 160 ValGly Gln Met Phe Thr Tyr Arg Pro Arg Gln His Ala Thr Val Gln 165 170 175Asp Cys Asn Cys Ser Ile Tyr Ser Gly His Val Thr Gly His Gln Met 180 185190 Ala (2) INFORMATION FOR SEQ ID NO: 197: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 1485 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (ix)FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 1..1485 (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 197: TGTGCCAGGA CCATCACCAC CGGAGCTTCT ATCACATACTCCACTTACGG CAAGTTCCTT 60 GCTGATGGAG GGTGTTCAGG CGGCGCGCAT GACGTGATCATATGCGACGA GTGCCATTCC 120 CAGGACGCCA CCACCATTCT TGGGATAGGC ACTGTCCTTGACCAGGCAGA GACGGCTGGA 180 GCTAGGCTCG TCGTCTTGGC CACGGCCACC CCTCCCGGCAGTGTGACAAC GCCCCACCCC 240 AACATCGAGG AAGTGGCCCT GCCTCAGGAG GGGGAGGTTCCCTTCTACGG CAGAGCCATT 300 CCCCTTGCTT TTATAAAGGG TGGTAGGCAT CTCATCTTCTGCCATTCCAA GAAAAAATGT 360 GATGAACTCG CCAAGCAACT GACCAGCCTG GGCGTGAACGCCGTGGCATA TTATAGAGGT 420 CTAGACGTCG CCGTCATACC CACAACAGGA GACGTGGTCGTGTGCAGCAC CGACGCGCTC 480 ATGACGGGAT TCACCGGCGA CTTTGATTCT GTCATAGACTGCAACTCCGC CGTCACTCAG 540 ACGGTGGACT TCAGTCTGGA TCCCACTTTT ACCATTGAGACTACCACAGT GCCCCAGGAC 600 GCAGTGTCCA GAAGCCAGCG TTGGGGCCGC ACGGGGAGAGGTAGGCACGG CATATACCGG 660 TATGTCTCGG CTGGAGAGAG ACCGTCTGGC ATGTTCGACTCCGTGGTGCT CTGTGAGTGC 720 TACGATGCCG GATGTGCATG GTACGATCTG ACTCCTGCCGAGACTACCGT GAGGTTGCGC 780 GCTTACNTAA ACACCCCCGG GCTCCCTGTC TGTCAGGACCATTTGGAATT CTGGGAGGGG 840 GTGTTCACGG GGCTCACTAA CATCGACGCT CACATGCTGTCACAGACCAA ACAGGGTGGG 900 GAGAATTTCC CATACCTTGT AGCGTACCAA GCAACAGTCTGTGTTCGCGC GAAAGCGCCC 960 CCCCCCAGCT GGGACACAAT GTGGAAATGC ATGCTCCGTCTCAAACCGAC NTTAACTGGC 1020 CCTACTCCCC TCTTGTACAG GCTGGGGCCC GTCCAGAATGAGATCACACT GACGCACCCC 1080 ATCACCAAGT ACATTATGGC TTGCATGTCT GCGGACTTGGAGGTCATTAC CAGCACTTGG 1140 GTTCTGGTGG GGGGCGTTGT GGCGGCCCTG GCGGCCTACTGCTTGACGGT GGGTTCGGTA 1200 GCCATAGTCG GTAGGATCAT CCTCTCTGGG AAACCTGCCATCATTCCCGA TAGGGAGGTA 1260 TTATACCAGC AATTTGATGA GATGGAGGAG TGCTCGGCCTCGTTGCCCTA TATGGACGAA 1320 ACACGTGCCA TTGCCGGACA ATTCAAAGAG AAAGTGCTCGGCTTCATCAG CACGACCGGC 1380 CAGAAGGCTG AAACTCTGAA GCCGGCAGCC ACGTCTGTGTGGAACAAGGC TGAGCAGTTC 1440 TGGNCCACAT ACATGTGGAA CTTCATCAGT GGGATACAATAATAG 1485 (2) INFORMATION FOR SEQ ID NO: 198: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 484 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: Cys Ala Arg Thr Ile Thr ThrGly Ala Ser Ile Thr Tyr Ser Thr Tyr 1 5 10 15 Gly Lys Phe Leu Ala AspGly Gly Cys Ser Gly Gly Ala His Asp Val 20 25 30 Ile Ile Cys Asp Glu CysHis Ser Gln Asp Ala Thr Thr Ile Leu Gly 35 40 45 Ile Gly Thr Val Leu AspGln Ala Glu Thr Ala Gly Ala Arg Leu Val 50 55 60 Val Leu Ala Thr Ala ThrPro Pro Gly Ser Val Thr Thr Pro His Pro 65 70 75 80 Asn Ile Glu Glu ValAla Leu Pro Gln Glu Gly Glu Val Pro Phe Tyr 85 90 95 Gly Arg Ala Ile ProLeu Ala Phe Ile Lys Gly Gly Arg His Leu Ile 100 105 110 Phe Cys His SerLys Lys Lys Cys Asp Glu Leu Ala Lys Gln Leu Thr 115 120 125 Ser Leu GlyVal Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ala 130 135 140 Val IlePro Thr Thr Gly Asp Val Val Val Cys Ser Thr Asp Ala Leu 145 150 155 160Met Thr Gly Phe Thr Gly Asp Phe Asp Ser Val Ile Asp Cys Asn Ser 165 170175 Ala Val Thr Gln Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr Ile 180185 190 Glu Thr Thr Thr Val Pro Gln Asp Ala Val Ser Arg Ser Gln Arg Trp195 200 205 Gly Arg Thr Gly Arg Gly Arg His Gly Ile Tyr Arg Tyr Val SerAla 210 215 220 Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Val Val Leu CysGlu Cys 225 230 235 240 Tyr Asp Ala Gly Cys Ala Trp Tyr Asp Leu Thr ProAla Glu Thr Thr 245 250 255 Val Arg Leu Arg Ala Tyr Xaa Asn Thr Pro GlyLeu Pro Val Cys Gln 260 265 270 Asp His Leu Glu Phe Trp Glu Gly Val PheThr Gly Leu Thr Asn Ile 275 280 285 Asp Ala His Met Leu Ser Gln Thr LysGln Gly Gly Glu Asn Phe Pro 290 295 300 Tyr Leu Val Ala Tyr Gln Ala ThrVal Cys Val Arg Ala Lys Ala Pro 305 310 315 320 Pro Pro Ser Trp Asp ThrMet Trp Lys Cys Met Leu Arg Leu Lys Pro 325 330 335 Xaa Leu Thr Gly ProThr Pro Leu Leu Tyr Arg Leu Gly Pro Val Gln 340 345 350 Asn Glu Ile ThrLeu Thr His Pro Ile Thr Lys Tyr Ile Met Ala Cys 355 360 365 Met Ser AlaAsp Leu Glu Val Ile Thr Ser Thr Trp Val Leu Val Gly 370 375 380 Gly ValVal Ala Ala Leu Ala Ala Tyr Cys Leu Thr Val Gly Ser Val 385 390 395 400Ala Ile Val Gly Arg Ile Ile Leu Ser Gly Lys Pro Ala Ile Ile Pro 405 410415 Asp Arg Glu Val Leu Tyr Gln Gln Phe Asp Glu Met Glu Glu Cys Ser 420425 430 Ala Ser Leu Pro Tyr Met Asp Glu Thr Arg Ala Ile Ala Gly Gln Phe435 440 445 Lys Glu Lys Val Leu Gly Phe Ile Ser Thr Thr Gly Gln Lys AlaGlu 450 455 460 Thr Leu Lys Pro Ala Ala Thr Ser Val Trp Asn Lys Ala GluGln Phe 465 470 475 480 Trp Xaa Thr Tyr (2) INFORMATION FOR SEQ ID NO:199: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 1485 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: cDNA (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 1..1485 (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 199: TGTGCCAGGA CCATCACCAC CGGAGCTTCTATCACATACT CCACTTACGG CAAGTTCCTT 60 GCTGATGGAG GGTGTTCAGG CGGCGCGTATGACGTGATCA TATGCGACGA GTGCCATTCC 120 CAGGACGCCA CCACCATTCT TGGGATAGGCACTGTCCTTG ACCAGGCAGA GACGGCTGGA 180 GCTAGGCTCG TCGTCTTGGC CACGGCCACCCCTCCCGGCA GTGTGACAAC GCCCCACCCC 240 AACATCGAGG AAGTGGCCCT GCCTCAGGAGGGGGAGGTTC CCTTCTACGG CAGAGCCATT 300 CCCCTTGCTT TTATAAAGGG TGGTAGGCATCTCATCTTCT GCCATTCCAA GAAAAAATGT 360 GATGAACTCG CCAAGCAACT GACCAGCCTGGGCGTGAACG CCGTGGCATA TTATAGAGGT 420 CTAGACGTCG CCGTCATCCC CACAGCAGGAGACGTGGTCG TGTGCAGCAC CGACGCGCTC 480 ATGACGGGAT TCACCGGCGA CTTTGATTCTGTCATAGACT GCAACTCCGC CGTCACTCAG 540 ACGGTGGACT TCAGTCTGGA TCCCACTTTTACCATTGAGA CTACCACAGT GCCCCAGGAC 600 GCAGTGTCCA GAAGCCAGCG TAGGGGCCGCACGGGGAGAG GTAGGCACGG CATATACCGG 660 TATGTCTCGG CTGGAGAGAG ACCNTCTGACATGTTCGACT CCGTGGTGCT CTGTGAGTGC 720 TACGATGCCG GATGTGCGTG GTATGATCTGACTCCTGCCG AGACTACCGT GAGGTTGCGC 780 GCTTACATAA ACACCCCCGG GCTCCCTGTCTGTCAGGACC ATTTGGAATT CTGGGAGGGG 840 GTGTTCACGG GGCTCACTAA CATCGACGCTCACATGCTGT CACAGACCAA ACAGGGTGGG 900 GAGAATTTNC CATACCTTGT AGCGTACCAAGCAACAGTCT GTGTTCGCGC GAAAGCGCCC 960 CCCCCCAGCT GGGACACAAT GTGGAAATGCATGCTCCGTC TCAAACCGAC TTTAACTGGC 1020 CCTACTCCCC TCTTGTACAG GCTGGGGCCCGTCCAGANTG AGATCACACT GACGCACCCC 1080 ATCACCAAGT ACATTATGGC TTGCATGTCTGCGGACTTGG AGGTCATTAC CANCACTTGG 1140 GTTCTGGTGG GGGGCGTTGT GGCGGCCCTGGCGGCCTACT GCTTGACGGT GGGTTCGGTA 1200 GCCATAGTCG GTAGGATCAT CCTCTCTGGGAAACCTGCCA TCATTCCCGA TAGGGAGGCA 1260 TTATACCAGC AATTTGATGA GATGGAGGAGTGCTCGGCCT CGTTGCCCTA TATGGACGAG 1320 ACACGTGCCA TTGCCGGACA ATTCAAAGAGAAAGTGCTCG GCTTCATCAG CACGACCGGC 1380 CAGAAGGCTG AAACTCTGAA GCCGGCAGCCACGTCTGTGT GGAACAAGGC TGAGCAGTTC 1440 TGGGCCACAT ACATGTGGAA CTTCATCAGCGGGATACAAT AATAG 1485 (2) INFORMATION FOR SEQ ID NO: 200: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 484 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: Cys Ala Arg Thr Ile Thr ThrGly Ala Ser Ile Thr Tyr Ser Thr Tyr 1 5 10 15 Gly Lys Phe Leu Ala AspGly Gly Cys Ser Gly Gly Ala Tyr Asp Val 20 25 30 Ile Ile Cys Asp Glu CysHis Ser Gln Asp Ala Thr Thr Ile Leu Gly 35 40 45 Ile Gly Thr Val Leu AspGln Ala Glu Thr Ala Gly Ala Arg Leu Val 50 55 60 Val Leu Ala Thr Ala ThrPro Pro Gly Ser Val Thr Thr Pro His Pro 65 70 75 80 Asn Ile Glu Glu ValAla Leu Pro Gln Glu Gly Glu Val Pro Phe Tyr 85 90 95 Gly Arg Ala Ile ProLeu Ala Phe Ile Lys Gly Gly Arg His Leu Ile 100 105 110 Phe Cys His SerLys Lys Lys Cys Asp Glu Leu Ala Lys Gln Leu Thr 115 120 125 Ser Leu GlyVal Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ala 130 135 140 Val IlePro Thr Ala Gly Asp Val Val Val Cys Ser Thr Asp Ala Leu 145 150 155 160Met Thr Gly Phe Thr Gly Asp Phe Asp Ser Val Ile Asp Cys Asn Ser 165 170175 Ala Val Thr Gln Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr Ile 180185 190 Glu Thr Thr Thr Val Pro Gln Asp Ala Val Ser Arg Ser Gln Arg Arg195 200 205 Gly Arg Thr Gly Arg Gly Arg His Gly Ile Tyr Arg Tyr Val SerAla 210 215 220 Gly Glu Arg Xaa Ser Asp Met Phe Asp Ser Val Val Leu CysGlu Cys 225 230 235 240 Tyr Asp Ala Gly Cys Ala Trp Tyr Asp Leu Thr ProAla Glu Thr Thr 245 250 255 Val Arg Leu Arg Ala Tyr Ile Asn Thr Pro GlyLeu Pro Val Cys Gln 260 265 270 Asp His Leu Glu Phe Trp Glu Gly Val PheThr Gly Leu Thr Asn Ile 275 280 285 Asp Ala His Met Leu Ser Gln Thr LysGln Gly Gly Glu Asn Xaa Pro 290 295 300 Tyr Leu Val Ala Tyr Gln Ala ThrVal Cys Val Arg Ala Lys Ala Pro 305 310 315 320 Pro Pro Ser Trp Asp ThrMet Trp Lys Cys Met Leu Arg Leu Lys Pro 325 330 335 Thr Leu Thr Gly ProThr Pro Leu Leu Tyr Arg Leu Gly Pro Val Gln 340 345 350 Xaa Glu Ile ThrLeu Thr His Pro Ile Thr Lys Tyr Ile Met Ala Cys 355 360 365 Met Ser AlaAsp Leu Glu Val Ile Thr Xaa Thr Trp Val Leu Val Gly 370 375 380 Gly ValVal Ala Ala Leu Ala Ala Tyr Cys Leu Thr Val Gly Ser Val 385 390 395 400Ala Ile Val Gly Arg Ile Ile Leu Ser Gly Lys Pro Ala Ile Ile Pro 405 410415 Asp Arg Glu Ala Leu Tyr Gln Gln Phe Asp Glu Met Glu Glu Cys Ser 420425 430 Ala Ser Leu Pro Tyr Met Asp Glu Thr Arg Ala Ile Ala Gly Gln Phe435 440 445 Lys Glu Lys Val Leu Gly Phe Ile Ser Thr Thr Gly Gln Lys AlaGlu 450 455 460 Thr Leu Lys Pro Ala Ala Thr Ser Val Trp Asn Lys Ala GluGln Phe 465 470 475 480 Trp Ala Thr Tyr (2) INFORMATION FOR SEQ ID NO:201: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 340 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A)NAME/KEY: CDS (B) LOCATION: 2..340 (ix) FEATURE: (A) NAME/KEY:mat_peptide (B) LOCATION: 2..337 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:201: C TCC ACT GTG ACT GAG AGA GAC ATC AGG GTC GAA GAA GAA GTC TAT 46Ser Thr Val Thr Glu Arg Asp Ile Arg Val Glu Glu Glu Val Tyr 1 5 10 15CAG TGT TGT GAT CTG GAG CCC GAG GCC CGC AAG GTA ATA ACC GCC CTC 94 GlnCys Cys Asp Leu Glu Pro Glu Ala Arg Lys Val Ile Thr Ala Leu 20 25 30 ACGGAG AGA CTC TAC GTG GGC GGC CCT ATG TAC AAT AGC AAG GGA GAC 142 Thr GluArg Leu Tyr Val Gly Gly Pro Met Tyr Asn Ser Lys Gly Asp 35 40 45 CTT TGCGGG TAT CGC AGG TGC CGC GCA AGC GGC GTA TAT ACC ACC AGC 190 Leu Cys GlyTyr Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser 50 55 60 TTC GGG AACACA CTG ACG TGC TAC CTT AAA GCC TCA GCA GCC ATC AGG 238 Phe Gly Asn ThrLeu Thr Cys Tyr Leu Lys Ala Ser Ala Ala Ile Arg 65 70 75 GCT GCG GGG CTGAAG GAC TGC ACC ATG CTG GTT TGC GGT GAC GAC TTA 286 Ala Ala Gly Leu LysAsp Cys Thr Met Leu Val Cys Gly Asp Asp Leu 80 85 90 95 GTC GTG ATC GCTGAA AGC GGT GGC GTC GAG GAG GAC AAG CGA GCC CTC 334 Val Val Ile Ala GluSer Gly Gly Val Glu Glu Asp Lys Arg Ala Leu 100 105 110 GGA GCT 340 GlyAla (2) INFORMATION FOR SEQ ID NO: 202: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 113 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear(ii) MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202:Ser Thr Val Thr Glu Arg Asp Ile Arg Val Glu Glu Glu Val Tyr Gln 1 5 1015 Cys Cys Asp Leu Glu Pro Glu Ala Arg Lys Val Ile Thr Ala Leu Thr 20 2530 Glu Arg Leu Tyr Val Gly Gly Pro Met Tyr Asn Ser Lys Gly Asp Leu 35 4045 Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser Phe 50 5560 Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala Ile Arg Ala 65 7075 80 Ala Gly Leu Lys Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu Val 8590 95 Val Ile Ala Glu Ser Gly Gly Val Glu Glu Asp Lys Arg Ala Leu Gly100 105 110 Ala (2) INFORMATION FOR SEQ ID NO: 203: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 340 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY: CDS(B) LOCATION: 2..340 (ix) FEATURE: (A) NAME/KEY: mat_peptide (B)LOCATION: 2..337 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: C TCC ACAGTG ACT GAA AGA GAC ATC AGG GTC GAG GAA GAG GTC TAC 46 Ser Thr Val ThrGlu Arg Asp Ile Arg Val Glu Glu Glu Val Tyr 1 5 10 15 CAG TGT TGT GACCTG GAG CCT GAA ACC CGC AAG GTA ATA TCT GCC CTC 94 Gln Cys Cys Asp LeuGlu Pro Glu Thr Arg Lys Val Ile Ser Ala Leu 20 25 30 ACT GAA AGA CTC TATGTG GGC GGT CCC ATG CAC AAC AGC AGG GGA GAC 142 Thr Glu Arg Leu Tyr ValGly Gly Pro Met His Asn Ser Arg Gly Asp 35 40 45 CTA TGC GGG TAC CGT AGATGC CGC GCG AGC GGC GTA TAC ACC ACA AGC 190 Leu Cys Gly Tyr Arg Arg CysArg Ala Ser Gly Val Tyr Thr Thr Ser 50 55 60 TTC GGG AAC ACT CTG ACG TGCTTC CTC AAG GCC ACA GCG GCC ACC AAA 238 Phe Gly Asn Thr Leu Thr Cys PheLeu Lys Ala Thr Ala Ala Thr Lys 65 70 75 GCC GCT GGC CTA AAG GAC TGC ACCATG TTG GTG TGT GGT GAC GAC TTA 286 Ala Ala Gly Leu Lys Asp Cys Thr MetLeu Val Cys Gly Asp Asp Leu 80 85 90 95 GTC GTT ATC GCC GAA AGC GAT GGTGTC GAA GAG GAC CGC CGA GCC CTC 334 Val Val Ile Ala Glu Ser Asp Gly ValGlu Glu Asp Arg Arg Ala Leu 100 105 110 GGA GCT 340 Gly Ala (2)INFORMATION FOR SEQ ID NO: 204: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 113 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii)MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: SerThr Val Thr Glu Arg Asp Ile Arg Val Glu Glu Glu Val Tyr Gln 1 5 10 15Cys Cys Asp Leu Glu Pro Glu Thr Arg Lys Val Ile Ser Ala Leu Thr 20 25 30Glu Arg Leu Tyr Val Gly Gly Pro Met His Asn Ser Arg Gly Asp Leu 35 40 45Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser Phe 50 55 60Gly Asn Thr Leu Thr Cys Phe Leu Lys Ala Thr Ala Ala Thr Lys Ala 65 70 7580 Ala Gly Leu Lys Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu Val 85 9095 Val Ile Ala Glu Ser Asp Gly Val Glu Glu Asp Arg Arg Ala Leu Gly 100105 110 Ala (2) INFORMATION FOR SEQ ID NO: 205: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 340 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY: CDS(B) LOCATION: 2..340 (ix) FEATURE: (A) NAME/KEY: mat_peptide (B)LOCATION: 2..337 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: C TCC ACGGTG ACC GAA AGG GAT ATC AGG ACC GAG GAA GAG ATC TAC 46 Ser Thr Val ThrGlu Arg Asp Ile Arg Thr Glu Glu Glu Ile Tyr 1 5 10 15 CAG TGC TGC GACCTG GAG CCC GAA GCC CGC AAG GTG ATA TCC GCC CTA 94 Gln Cys Cys Asp LeuGlu Pro Glu Ala Arg Lys Val Ile Ser Ala Leu 20 25 30 ACG GAA AGA CTC TACGTG GGC GGT CCC ATG TAC AAC TCC AAG GGG GAC 142 Thr Glu Arg Leu Tyr ValGly Gly Pro Met Tyr Asn Ser Lys Gly Asp 35 40 45 CTA TGC GGG CAA CGG AGGTGC CGC GCA AGC GGG GTC TAC ACC ACC AGC 190 Leu Cys Gly Gln Arg Arg CysArg Ala Ser Gly Val Tyr Thr Thr Ser 50 55 60 TTC GGG AAC ACT GTA ACG TGTTAT CTC AAG GCC GTT GCG GCT ACT AGG 238 Phe Gly Asn Thr Val Thr Cys TyrLeu Lys Ala Val Ala Ala Thr Arg 65 70 75 GCC GCA GGT CTG AAA GGT TGC AGCATG CTG GTT TGT GGA GAC GAC TTA 286 Ala Ala Gly Leu Lys Gly Cys Ser MetLeu Val Cys Gly Asp Asp Leu 80 85 90 95 GTC GTC ATC TGC GAG AGC GGC GGCGTA GAG GAG GAT GCA AGA GCC CTC 334 Val Val Ile Cys Glu Ser Gly Gly ValGlu Glu Asp Ala Arg Ala Leu 100 105 110 CGA GCC 340 Arg Ala (2)INFORMATION FOR SEQ ID NO: 206: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 113 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii)MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: SerThr Val Thr Glu Arg Asp Ile Arg Thr Glu Glu Glu Ile Tyr Gln 1 5 10 15Cys Cys Asp Leu Glu Pro Glu Ala Arg Lys Val Ile Ser Ala Leu Thr 20 25 30Glu Arg Leu Tyr Val Gly Gly Pro Met Tyr Asn Ser Lys Gly Asp Leu 35 40 45Cys Gly Gln Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser Phe 50 55 60Gly Asn Thr Val Thr Cys Tyr Leu Lys Ala Val Ala Ala Thr Arg Ala 65 70 7580 Ala Gly Leu Lys Gly Cys Ser Met Leu Val Cys Gly Asp Asp Leu Val 85 9095 Val Ile Cys Glu Ser Gly Gly Val Glu Glu Asp Ala Arg Ala Leu Arg 100105 110 Ala (2) INFORMATION FOR SEQ ID NO: 207: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 340 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY: CDS(B) LOCATION: 2..340 (ix) FEATURE: (A) NAME/KEY: mat_peptide (B)LOCATION: 2..337 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: C TCC ACGGTG ACT GAA AGG GAC ATT AGG GTC GAG GAA GAG ATC TAC 46 Ser Thr Val ThrGlu Arg Asp Ile Arg Val Glu Glu Glu Ile Tyr 1 5 10 15 CAG TGC TGT GACCTG GAG CCC GAG GCA CGC AAG GTG ATA TCC GCT CTC 94 Gln Cys Cys Asp LeuGlu Pro Glu Ala Arg Lys Val Ile Ser Ala Leu 20 25 30 ACA GAA AGA CTC TACAAG GGC GGC CCC ATG TAT AAC AGC AAG GGG GAC 142 Thr Glu Arg Leu Tyr LysGly Gly Pro Met Tyr Asn Ser Lys Gly Asp 35 40 45 CTA TGC GGG CTT CGG AGGTGC CGC GCA AGC GGG GTA TAC ACC ACA AGC 190 Leu Cys Gly Leu Arg Arg CysArg Ala Ser Gly Val Tyr Thr Thr Ser 50 55 60 TTC GGG AAC ACG GTG ACA TGCTAC CTT AAA GCC ACA GCA GCC ACC AGG 238 Phe Gly Asn Thr Val Thr Cys TyrLeu Lys Ala Thr Ala Ala Thr Arg 65 70 75 GCT GCA GGG CTG AAA GAT TGC ACTATG CTG GTA TGC GGT GAC GAC TTA 286 Ala Ala Gly Leu Lys Asp Cys Thr MetLeu Val Cys Gly Asp Asp Leu 80 85 90 95 GTC GTT ATT GCC GAA AGC GGT GGCGTG GAG GAG GAC GCC CGA GCC CTC 334 Val Val Ile Ala Glu Ser Gly Gly ValGlu Glu Asp Ala Arg Ala Leu 100 105 110 CGA GCC 340 Arg Ala (2)INFORMATION FOR SEQ ID NO: 208: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 113 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii)MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: SerThr Val Thr Glu Arg Asp Ile Arg Val Glu Glu Glu Ile Tyr Gln 1 5 10 15Cys Cys Asp Leu Glu Pro Glu Ala Arg Lys Val Ile Ser Ala Leu Thr 20 25 30Glu Arg Leu Tyr Lys Gly Gly Pro Met Tyr Asn Ser Lys Gly Asp Leu 35 40 45Cys Gly Leu Arg Arg Cys Arg Ala Ser Gly Val Tyr Thr Thr Ser Phe 50 55 60Gly Asn Thr Val Thr Cys Tyr Leu Lys Ala Thr Ala Ala Thr Arg Ala 65 70 7580 Ala Gly Leu Lys Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu Val 85 9095 Val Ile Ala Glu Ser Gly Gly Val Glu Glu Asp Ala Arg Ala Leu Arg 100105 110 Ala (2) INFORMATION FOR SEQ ID NO: 209: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 340 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY: CDS(B) LOCATION: 1..340 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209:CCCCACCGTG ACNGAGAGGG ACNTCAGGGT CGAGGAAGAG GTCTATCAGT GCTGTAATCT 60GGAGNCCGAT GNCCGCAAGG TCATCAACGC CCTCACAGAG AGACTCTACG TGGGCGGCCC 120TATGCACAAC AGCAAGGGAG ACCTGTGTGG CATCCGTAGA TGCCGCGCGA GCGGCGTTTA 180CACCACGAGC TTCGGAAACA CGCTGACTTG CTACCTCAAA GCCACAGCGG CCACCAGGGC 240CGCGGGCTTG AAGGATTGCA CCATGCTGGT CTGCGGNGAC GACCTGGTTG TCATTGCTGA 300GAGCATTGGC ATAGACGAGG ACAAGCAAGC CCTCCGNACT 340 (2) INFORMATION FOR SEQID NO: 210: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 113 amino acids(B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii)MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 210: Pro Thr Val Thr Glu Arg Asp XaaArg Val Glu Glu Glu Val Tyr Gln 1 5 10 15 Cys Cys Asn Leu Glu Xaa AspXaa Arg Lys Val Ile Asn Ala Leu Thr 20 25 30 Glu Arg Leu Tyr Val Gly GlyPro Met His Asn Ser Lys Gly Asp Leu 35 40 45 Cys Gly Ile Arg Arg Cys ArgAla Ser Gly Val Tyr Thr Thr Ser Phe 50 55 60 Gly Asn Thr Leu Thr Cys TyrLeu Lys Ala Thr Ala Ala Thr Arg Ala 65 70 75 80 Ala Gly Leu Lys Asp CysThr Met Leu Val Cys Gly Asp Asp Leu Val 85 90 95 Val Ile Ala Glu Ser IleGly Ile Asp Glu Asp Lys Gln Ala Leu Arg 100 105 110 Thr (2) INFORMATIONFOR SEQ ID NO: 211: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 340 basepairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY:linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE:NO (ix) FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 1..340 (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 211: CTCGACTGTG NCCGAGAGGG ACATCAGGAC AGAGGGAGAGGTCTATCAGT GTTGCGACCT 60 GGAACCGGAA GCCCGCAAGG TAATCACCGC CCTCACTGAGAGACTCTATG TGGGCGGACC 120 CATGTTCAAC AGCAAGGGAG ACCTGTGCGG ACAACGCCGGTGCCGCGCAA GCGGCGTGTT 180 CACCACCAGC TTCGGGAACA CACTGACGTG CTACCTTAAAGCCACAGCTG CTACTAGAGC 240 AGCCGGCTTA AAAGATTGCA CCATGCTGGT CTGCGGTGACGACTTAGTCG TTATTTCCGA 300 GAGCGCCGGT GTGGAGGAGG ATCCCANAAC CCNNCGACCN340 (2) INFORMATION FOR SEQ ID NO: 212: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 113 amino acids (B) TYPE: amino acid (C) STRANDEDNESS:single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL:NO (iii) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: SerThr Val Xaa Glu Arg Asp Ile Arg Thr Glu Gly Glu Val Tyr Gln 1 5 10 15Cys Cys Asp Leu Glu Pro Glu Ala Arg Lys Val Ile Thr Ala Leu Thr 20 25 30Glu Arg Leu Tyr Val Gly Gly Pro Met Phe Asn Ser Lys Gly Asp Leu 35 40 45Cys Gly Gln Arg Arg Cys Arg Ala Ser Gly Val Phe Thr Thr Ser Phe 50 55 60Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Thr Ala Ala Thr Arg Ala 65 70 7580 Ala Gly Leu Lys Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu Val 85 9095 Val Ile Ser Glu Ser Ala Gly Val Glu Glu Asp Pro Xaa Thr Xaa Arg 100105 110 Pro (2) INFORMATION FOR SEQ ID NO: 213: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 340 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY: CDS(B) LOCATION: 2..340 (ix) FEATURE: (A) NAME/KEY: mat_peptide (B)LOCATION: 2..337 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: C TCA ACAGTC ACC GAG AAC GAC ATC CGT GTT GAG GAG TCA ATT TAC 46 Ser Thr Val ThrGlu Asn Asp Ile Arg Val Glu Glu Ser Ile Tyr 1 5 10 15 CAA TGT TGT GACTTG GCC CCC GAG GCC AGA CAG GCC ATA AAG TCG CTC 94 Gln Cys Cys Asp LeuAla Pro Glu Ala Arg Gln Ala Ile Lys Ser Leu 20 25 30 ACA GAG CGG CTT TATATC GGG GGT CCC CTG ACT AAT TCA AAG GGG CAG 142 Thr Glu Arg Leu Tyr IleGly Gly Pro Leu Thr Asn Ser Lys Gly Gln 35 40 45 AAC TGT GGC TAT CGC CGATGC CGC GCA AGC GGC GTG CTG ACG ACC AGC 190 Asn Cys Gly Tyr Arg Arg CysArg Ala Ser Gly Val Leu Thr Thr Ser 50 55 60 TGC GGT AAT ACC CTT ACA TGTTAC CTA AAG GCC TCT GCA GCC TGT CGA 238 Cys Gly Asn Thr Leu Thr Cys TyrLeu Lys Ala Ser Ala Ala Cys Arg 65 70 75 GCT GCG AAG CTC CAG GAC TGC ACGATG CTC GTG TGC GGG GAC GAC CTT 286 Ala Ala Lys Leu Gln Asp Cys Thr MetLeu Val Cys Gly Asp Asp Leu 80 85 90 95 GTC GTT ATC TGT GAA AGC GCG GGAACC CAA GAG GAC GCG GCG AGC CTA 334 Val Val Ile Cys Glu Ser Ala Gly ThrGln Glu Asp Ala Ala Ser Leu 100 105 110 CGA GTC 340 Arg Val (2)INFORMATION FOR SEQ ID NO: 214: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 113 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii)MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: SerThr Val Thr Glu Asn Asp Ile Arg Val Glu Glu Ser Ile Tyr Gln 1 5 10 15Cys Cys Asp Leu Ala Pro Glu Ala Arg Gln Ala Ile Lys Ser Leu Thr 20 25 30Glu Arg Leu Tyr Ile Gly Gly Pro Leu Thr Asn Ser Lys Gly Gln Asn 35 40 45Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Cys 50 55 60Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala Cys Arg Ala 65 70 7580 Ala Lys Leu Gln Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu Val 85 9095 Val Ile Cys Glu Ser Ala Gly Thr Gln Glu Asp Ala Ala Ser Leu Arg 100105 110 Val (2) INFORMATION FOR SEQ ID NO: 215: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 340 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY: CDS(B) LOCATION: 2..340 (ix) FEATURE: (A) NAME/KEY: mat_peptide (B)LOCATION: 2..340 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: C TCA ACCGTC ACG GAG AGG GAT ATA AGA ACA GAA GAA TCC ATA TAT 46 Ser Thr Val ThrGlu Arg Asp Ile Arg Thr Glu Glu Ser Ile Tyr 1 5 10 15 CAA GCT TGT TCCCTG CCC CAA GAG GCC AGA ACT GTC ATA CAC TCG CTC 94 Gln Ala Cys Ser LeuPro Gln Glu Ala Arg Thr Val Ile His Ser Leu 20 25 30 ACC GAG AGA CTC TACGTG GGA GGG CCC ATG ATA AAC AGC AAA GGG CAA 142 Thr Glu Arg Leu Tyr ValGly Gly Pro Met Ile Asn Ser Lys Gly Gln 35 40 45 TCC TGC GGT TAC AGG CGTTGC CGC GCA AGC GGT GTT TTC ACC ACC AGC 190 Ser Cys Gly Tyr Arg Arg CysArg Ala Ser Gly Val Phe Thr Thr Ser 50 55 60 ATG GGG AAT ACC ATG ACG TGTTAC ATC AAA GCC CTT GCA GCG TGT AAA 238 Met Gly Asn Thr Met Thr Cys TyrIle Lys Ala Leu Ala Ala Cys Lys 65 70 75 GCC GCA GGG ATC GTG GAC CCC GTCATG CTG GTG TGT GGA GAC GAC CTG 286 Ala Ala Gly Ile Val Asp Pro Val MetLeu Val Cys Gly Asp Asp Leu 80 85 90 95 GTC GTC ATC TCG GAG AGC CAG GGTAAC GAG GAG GAC GAG CGA AAC CTG 334 Val Val Ile Ser Glu Ser Gln Gly AsnGlu Glu Asp Glu Arg Asn Leu 100 105 110 AGA GCT 340 Arg Ala (2)INFORMATION FOR SEQ ID NO: 216: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 113 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii)MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: SerThr Val Thr Glu Arg Asp Ile Arg Thr Glu Glu Ser Ile Tyr Gln 1 5 10 15Ala Cys Ser Leu Pro Gln Glu Ala Arg Thr Val Ile His Ser Leu Thr 20 25 30Glu Arg Leu Tyr Val Gly Gly Pro Met Ile Asn Ser Lys Gly Gln Ser 35 40 45Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Phe Thr Thr Ser Met 50 55 60Gly Asn Thr Met Thr Cys Tyr Ile Lys Ala Leu Ala Ala Cys Lys Ala 65 70 7580 Ala Gly Ile Val Asp Pro Val Met Leu Val Cys Gly Asp Asp Leu Val 85 9095 Val Ile Ser Glu Ser Gln Gly Asn Glu Glu Asp Glu Arg Asn Leu Arg 100105 110 Ala (2) INFORMATION FOR SEQ ID NO: 217: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 340 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY: CDS(B) LOCATION: 2..340 (ix) FEATURE: (A) NAME/KEY: mat_peptide (B)LOCATION: 2..340 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: C TCG ACTGTC ACT GAA CAG GAC ATC AGG GTG GAA GAG GAG ATA TAT 46 Ser Thr Val ThrGlu Gln Asp Ile Arg Val Glu Glu Glu Ile Tyr 1 5 10 15 CAA TGC TGC AACCTT GAA CCG GAG GCC AGG AAA GTG ATC TCC TCC CTC 94 Gln Cys Cys Asn LeuGlu Pro Glu Ala Arg Lys Val Ile Ser Ser Leu 20 25 30 ACG GAG CGG CTT TACTGC GGA GGC CCT ATG TTT AAC AGC AAG GGG GCC 142 Thr Glu Arg Leu Tyr CysGly Gly Pro Met Phe Asn Ser Lys Gly Ala 35 40 45 CAG TGT GGT TAT CGC CGTTGC CGT GCC AGT GGA GTT CTG CCT ACC AGC 190 Gln Cys Gly Tyr Arg Arg CysArg Ala Ser Gly Val Leu Pro Thr Ser 50 55 60 TTT GGC AAC ACA ATC ACT TGTTAC ATC AAG GCC ACA ACG GCC GCG AAG 238 Phe Gly Asn Thr Ile Thr Cys TyrIle Lys Ala Thr Thr Ala Ala Lys 65 70 75 GCC GCA GGC CTC CGG AAC CCG GACTTT CTT GTC TGC GGA GAT GAT CTG 286 Ala Ala Gly Leu Arg Asn Pro Asp PheLeu Val Cys Gly Asp Asp Leu 80 85 90 95 GTC GTG GTG GCT GAG AGT GAT GGCGTC GAC GAG GAT AGA GCA GCC CTG 334 Val Val Val Ala Glu Ser Asp Gly ValAsp Glu Asp Arg Ala Ala Leu 100 105 110 AGA GCC 340 Arg Ala (2)INFORMATION FOR SEQ ID NO: 218: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 113 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii)MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: SerThr Val Thr Glu Gln Asp Ile Arg Val Glu Glu Glu Ile Tyr Gln 1 5 10 15Cys Cys Asn Leu Glu Pro Glu Ala Arg Lys Val Ile Ser Ser Leu Thr 20 25 30Glu Arg Leu Tyr Cys Gly Gly Pro Met Phe Asn Ser Lys Gly Ala Gln 35 40 45Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Pro Thr Ser Phe 50 55 60Gly Asn Thr Ile Thr Cys Tyr Ile Lys Ala Thr Thr Ala Ala Lys Ala 65 70 7580 Ala Gly Leu Arg Asn Pro Asp Phe Leu Val Cys Gly Asp Asp Leu Val 85 9095 Val Val Ala Glu Ser Asp Gly Val Asp Glu Asp Arg Ala Ala Leu Arg 100105 110 Ala (2) INFORMATION FOR SEQ ID NO: 219: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 10 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: Arg Ser Glu Gly Arg Thr SerTrp Ala Gln 1 5 10 (2) INFORMATION FOR SEQ ID NO: 220: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 10 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: Arg Ser Glu Gly Arg Thr SerTrp Ala Gln 1 5 10 (2) INFORMATION FOR SEQ ID NO: 221: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 10 amino acids (B) TYPE: amino acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: Arg Thr Glu Gly Arg Thr SerTrp Ala Gln 1 5 10 (2) INFORMATION FOR SEQ ID NO: 222: (i) SEQUENCECHARACTERISTICS: (A) LENGTH: 629 base pairs (B) TYPE: nucleic acid (C)STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii)HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix) FEATURE: (A) NAME/KEY: CDS(B) LOCATION: 3..629 (ix) FEATURE: (A) NAME/KEY: mat_peptide (B)LOCATION: 3..629 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: TA GAC TTTTGG GAG AGC GTC TTC ACT GGA CTA ACT CAC ATA GAT GCC 47 Asp Phe Trp GluSer Val Phe Thr Gly Leu Thr His Ile Asp Ala 1 5 10 15 CAC TTT CTG TCACAG ACT AAG CAG CAG GGA CTC AAC TTC TCG TTC CTG 95 His Phe Leu Ser GlnThr Lys Gln Gln Gly Leu Asn Phe Ser Phe Leu 20 25 30 ACT GCC TAC CAA GCCACT GTG TGC GCT CGC GCG CAG GCT CCT CCC CCA 143 Thr Ala Tyr Gln Ala ThrVal Cys Ala Arg Ala Gln Ala Pro Pro Pro 35 40 45 AGT TGG GAC GAG ATG TGGAAG TGT CTC GTA CGG CTT AAG CCA ACA CTA 191 Ser Trp Asp Glu Met Trp LysCys Leu Val Arg Leu Lys Pro Thr Leu 50 55 60 CAT GGA CCT ACG CCT CTT CTATAT CGG TTG GGG CCT GTC CAA AAT GAA 239 His Gly Pro Thr Pro Leu Leu TyrArg Leu Gly Pro Val Gln Asn Glu 65 70 75 ATC TGC TTG ACA CAC CCC ATC ACAAAA TAC ATC ATG GCA TGC ATG TCA 287 Ile Cys Leu Thr His Pro Ile Thr LysTyr Ile Met Ala Cys Met Ser 80 85 90 95 GCT GAT CTG GAA GTA ACC ACC AGCACC TGG GTT TTG CTT GGA GGG GTC 335 Ala Asp Leu Glu Val Thr Thr Ser ThrTrp Val Leu Leu Gly Gly Val 100 105 110 CTC GCG GCC CTA GCG GCC TAC TGCTTG TCA GTC GGT TGT GTT GTG ATT 383 Leu Ala Ala Leu Ala Ala Tyr Cys LeuSer Val Gly Cys Val Val Ile 115 120 125 GTG GGT CAT ATC GAG CTG GGG GGCAAG CCG GCA ATC GTT CCA GAC AAA 431 Val Gly His Ile Glu Leu Gly Gly LysPro Ala Ile Val Pro Asp Lys 130 135 140 GAG GTG TTG TAT CAA CAA TAC GATGAG ATG GAA GAG TGC TCA CAA GCT 479 Glu Val Leu Tyr Gln Gln Tyr Asp GluMet Glu Glu Cys Ser Gln Ala 145 150 155 GCC CCA TAT ATC GAA CAA GCT CAGGTA ATA GCT CAC CAG TTC AAG GAA 527 Ala Pro Tyr Ile Glu Gln Ala Gln ValIle Ala His Gln Phe Lys Glu 160 165 170 175 AAA GTC CTT GGA TTG CTG CAGCGA GCC ACC CAA CAA CAA GCT GTC ATT 575 Lys Val Leu Gly Leu Leu Gln ArgAla Thr Gln Gln Gln Ala Val Ile 180 185 190 GAG CCC ATA GTA ACT ACC AACTGG CAA AAG CTT GAG GCC TTT TGG CAC 623 Glu Pro Ile Val Thr Thr Asn TrpGln Lys Leu Glu Ala Phe Trp His 195 200 205 AAG CAT 629 Lys His (2)INFORMATION FOR SEQ ID NO: 223: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 209 amino acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii)MOLECULE TYPE: protein (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: AspPhe Trp Glu Ser Val Phe Thr Gly Leu Thr His Ile Asp Ala His 1 5 10 15Phe Leu Ser Gln Thr Lys Gln Gln Gly Leu Asn Phe Ser Phe Leu Thr 20 25 30Ala Tyr Gln Ala Thr Val Cys Ala Arg Ala Gln Ala Pro Pro Pro Ser 35 40 45Trp Asp Glu Met Trp Lys Cys Leu Val Arg Leu Lys Pro Thr Leu His 50 55 60Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Pro Val Gln Asn Glu Ile 65 70 7580 Cys Leu Thr His Pro Ile Thr Lys Tyr Ile Met Ala Cys Met Ser Ala 85 9095 Asp Leu Glu Val Thr Thr Ser Thr Trp Val Leu Leu Gly Gly Val Leu 100105 110 Ala Ala Leu Ala Ala Tyr Cys Leu Ser Val Gly Cys Val Val Ile Val115 120 125 Gly His Ile Glu Leu Gly Gly Lys Pro Ala Ile Val Pro Asp LysGlu 130 135 140 Val Leu Tyr Gln Gln Tyr Asp Glu Met Glu Glu Cys Ser GlnAla Ala 145 150 155 160 Pro Tyr Ile Glu Gln Ala Gln Val Ile Ala His GlnPhe Lys Glu Lys 165 170 175 Val Leu Gly Leu Leu Gln Arg Ala Thr Gln GlnGln Ala Val Ile Glu 180 185 190 Pro Ile Val Thr Thr Asn Trp Gln Lys LeuGlu Ala Phe Trp His Lys 195 200 205 His (2) INFORMATION FOR SEQ ID NO:224: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 amino acids (B) TYPE:amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: peptide (ix) FEATURE: (A) NAME/KEY: Peptide (B) LOCATION: 2..12(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: Ile His Tyr Arg Asn Ala SerGly Ile Tyr His Ile 1 5 10 (2) INFORMATION FOR SEQ ID NO: 225: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: Val Asn Tyr Arg AsnAla Ser Gly Ile Tyr His Ile 1 5 10 (2) INFORMATION FOR SEQ ID NO: 226:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: Val Asn Tyr Arg AsnAla Ser Gly Val Tyr His Ile 1 5 10 (2) INFORMATION FOR SEQ ID NO: 227:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: Val Asn Tyr His AsnThr Ser Gly Ile Tyr His Leu 1 5 10 (2) INFORMATION FOR SEQ ID NO: 228:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: Gln His Tyr Arg AsnAla Ser Gly Ile Tyr His Val 1 5 10 (2) INFORMATION FOR SEQ ID NO: 229:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: Gln His Tyr Arg AsnVal Ser Gly Ile Tyr His Val 1 5 10 (2) INFORMATION FOR SEQ ID NO: 230:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: Ile His Tyr Arg AsnAla Ser Asp Gly Tyr Tyr Ile 1 5 10 (2) INFORMATION FOR SEQ ID NO: 231:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: Leu Gln Val Lys AsnThr Ser Ser Ser Tyr Met Val 1 5 10 (2) INFORMATION FOR SEQ ID NO: 232:(i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 11 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: Val Trp Gln Leu ArgAla Ile Val Leu His Val 1 5 10 (2) INFORMATION FOR SEQ ID NO: 233: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 11 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: Val Tyr Glu Ala AspTyr His Ile Leu His Leu 1 5 10 (2) INFORMATION FOR SEQ ID NO: 234: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 11 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: Val Tyr Glu Thr AspAsn His Ile Leu His Leu 1 5 10 (2) INFORMATION FOR SEQ ID NO: 235: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 11 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: Val Tyr Glu Thr GluAsn His Ile Leu His Leu 1 5 10 (2) INFORMATION FOR SEQ ID NO: 236: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 11 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: Val Phe Glu Thr ValHis His Ile Leu His Leu 1 5 10 (2) INFORMATION FOR SEQ ID NO: 237: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 11 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: Val Phe Glu Thr GluHis His Ile Leu His Leu 1 5 10 (2) INFORMATION FOR SEQ ID NO: 238: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 11 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: Val Phe Glu Thr AspHis His Ile Met His Leu 1 5 10 (2) INFORMATION FOR SEQ ID NO: 239: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 11 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: Val Tyr Glu Thr GluAsn His Ile Leu His Leu 1 5 10 (2) INFORMATION FOR SEQ ID NO: 240: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 11 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: Val Tyr Glu Ala AspAla Leu Ile Leu His Ala 1 5 10 (2) INFORMATION FOR SEQ ID NO: 241: (i)SEQUENCE CHARACTERISTICS: (A) LENGTH: 13 amino acids (B) TYPE: aminoacid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULE TYPE:peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: Val Gln Asp Gly AsnThr Ser Ala Cys Trp Thr Pro Val 1 5 10 (2) INFORMATION FOR SEQ ID NO:242: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 13 amino acids (B) TYPE:amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: Val Lys Thr GlyAsn Gln Ser Arg Cys Trp Val Ala Leu 1 5 10 (2) INFORMATION FOR SEQ IDNO: 243: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 13 amino acids (B)TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii)MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: ValLys Thr Gly Asn Gln Ser Arg Cys Trp Val Ala Leu 1 5 10 (2) INFORMATIONFOR SEQ ID NO: 244: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 13 aminoacids (B) TYPE: amino acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear(ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244:Val Arg Thr Gly Asn Gln Ser Arg Cys Trp Val Ala Leu 1 5 10 (2)INFORMATION FOR SEQ ID NO: 245: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 13 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 245: Val Lys Thr Gly Asn Gln Ser Arg Cys Trp Ile Ala Leu 1 510 (2) INFORMATION FOR SEQ ID NO: 246: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 13 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 246: Val Lys Thr Gly Asn Gln Ser Arg Cys Trp Ile Ala Leu 1 510 (2) INFORMATION FOR SEQ ID NO: 247: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 13 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 247: Val Lys Thr Gly Asn Ser Val Arg Cys Trp Ile Pro Leu 1 510 (2) INFORMATION FOR SEQ ID NO: 248: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 13 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 248: Val Lys Thr Gly Asn Val Ser Arg Cys Trp Ile Ser Leu 1 510 (2) INFORMATION FOR SEQ ID NO: 249: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 13 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 249: Val Arg Lys Asp Asn Val Ser Arg Cys Trp Val Gln Ile 5 10(2) INFORMATION FOR SEQ ID NO: 250: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 10 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 250: Ala Pro Ser Phe Gly Ala Val Thr Ala Pro 1 5 10 (2)INFORMATION FOR SEQ ID NO: 251: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 10 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 251: Val Ser Gln Pro Gly Ala Leu Thr Lys Gly 1 5 10 (2)INFORMATION FOR SEQ ID NO: 252: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 10 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 252: Val Lys Tyr Val Gly Ala Thr Thr Ala Ser 1 5 10 (2)INFORMATION FOR SEQ ID NO: 253: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 10 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 253: Ala Pro Tyr Ile Gly Ala Pro Val Glu Ser 1 5 10 (2)INFORMATION FOR SEQ ID NO: 254: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 10 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 254: Ala Gln His Leu Asn Ala Pro Leu Glu Ser 1 5 10 (2)INFORMATION FOR SEQ ID NO: 255: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 10 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 255: Ser Pro Tyr Val Gly Ala Pro Leu Glu Pro 1 5 10 (2)INFORMATION FOR SEQ ID NO: 256: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 10 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 256: Ser Pro Tyr Ala Gly Ala Pro Leu Glu Pro 1 5 10 (2)INFORMATION FOR SEQ ID NO: 257: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 10 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 257: Ala Pro Tyr Leu Gly Ala Pro Leu Glu Ser 1 5 10 (2)INFORMATION FOR SEQ ID NO: 258: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 10 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 258: Ala Pro Tyr Leu Gly Ala Pro Leu Glu Ser 1 5 10 (2)INFORMATION FOR SEQ ID NO: 259: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 10 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 259: Ala Pro Tyr Val Gly Ala Pro Leu Glu Ser 1 5 10 (2)INFORMATION FOR SEQ ID NO: 260: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 11 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 260: Asn Val Pro Tyr Leu Gly Ala Pro Leu Thr Ser 1 5 10 (2)INFORMATION FOR SEQ ID NO: 261: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 10 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 261: Ala Pro His Leu Arg Ala Pro Leu Ser Ser 1 5 10 (2)INFORMATION FOR SEQ ID NO: 262: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 10 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 262: Ala Pro Tyr Leu Gly Ala Pro Leu Thr Ser 1 5 10 (2)INFORMATION FOR SEQ ID NO: 263: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 10 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 263: Arg Pro Arg Gln His Ala Thr Val Gln Asp 1 5 10 (2)INFORMATION FOR SEQ ID NO: 264: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 10 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 264: Ser Pro Gln His His Lys Phe Val Gln Asp 1 5 10 (2)INFORMATION FOR SEQ ID NO: 265: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 10 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 265: Arg Pro Arg Arg Leu Trp Thr Thr Gln Glu 1 5 10 (2)INFORMATION FOR SEQ ID NO: 266: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 10 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 266: Pro Pro Arg Ile His Glu Thr Thr Gln Asp 1 5 10 (2)INFORMATION FOR SEQ ID NO: 267: (i) SEQUENCE CHARACTERISTICS: (A)LENGTH: 14 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single (D)TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 267: Thr Ile Ser Tyr Ala Asn Gly Ser Gly Pro Ser Asp Asp Lys1 5 10 (2) INFORMATION FOR SEQ ID NO: 268: (i) SEQUENCE CHARACTERISTICS:(A) LENGTH: 19 amino acids (B) TYPE: amino acid (C) STRANDEDNESS: single(D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (xi) SEQUENCEDESCRIPTION: SEQ ID NO: 268: Ser Arg Arg Gln Pro Ile Pro Arg Ala Arg ArgThr Glu Gly Arg Ser 1 5 10 15 Trp Ala Gln (2) INFORMATION FOR SEQ ID NO:269: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 1443 base pairs (B) TYPE:nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: linear (ii) MOLECULETYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iii) ANTI-SENSE: NO (ix)FEATURE: (A) NAME/KEY: CDS (B) LOCATION: 1..1443 (ix) FEATURE: (A)NAME/KEY: mat_peptide (B) LOCATION: 1..1443 (xi) SEQUENCE DESCRIPTION:SEQ ID NO: 269: ACC ATC ACC ACC GGA GCT TCT ATC ACA TAC TCC ACT TAC GGCAAG TTC 48 Thr Ile Thr Thr Gly Ala Ser Ile Thr Tyr Ser Thr Tyr Gly LysPhe 1 5 10 15 CTT GCT GAT GGA GGG TGT TCA GGC GGC GCG TAT GAC GTG ATCATA TGC 96 Leu Ala Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp Val Ile IleCys 20 25 30 GAC GAG TGC CAT TCC CAG GAC GCC ACC ACC ATT CTT GGG ATA GGCACT 144 Asp Glu Cys His Ser Gln Asp Ala Thr Thr Ile Leu Gly Ile Gly Thr35 40 45 GTC CTT GAC CAG GCA GAG ACG GCT GGA GCT AGG CTC GTC GTC TTG GCC192 Val Leu Asp Gln Ala Glu Thr Ala Gly Ala Arg Leu Val Val Leu Ala 5055 60 ACG GCC ACC CCT CCC GGC AGT GTG ACA ACG CCC CAC CCC AAC ATC GAG240 Thr Ala Thr Pro Pro Gly Ser Val Thr Thr Pro His Pro Asn Ile Glu 6570 75 80 GAA GTG GCC CTG CCT CAG GAG GGG GAG GTT CCC TTC TAC GGC AGA GCC288 Glu Val Ala Leu Pro Gln Glu Gly Glu Val Pro Phe Tyr Gly Arg Ala 8590 95 ATT CCC CTT GCT TTT ATA AAG GGT GGT AGG CAT CTC ATC TTC TGC CAT336 Ile Pro Leu Ala Phe Ile Lys Gly Gly Arg His Leu Ile Phe Cys His 100105 110 TCC AAG AAA AAA TGT GAT GAA CTC GCC AAG CAA CTG ACC AGC CTG GGC384 Ser Lys Lys Lys Cys Asp Glu Leu Ala Lys Gln Leu Thr Ser Leu Gly 115120 125 GTG AAC GCC GTG GCA TAT TAT AGA GGT CTA GAC GTC GCC GTC ATC CCC432 Val Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ala Val Ile Pro 130135 140 ACA GCA GGA GAC GTG GTC GTG TGC AGC ACC GAC GCG CTC ATG ACG GGA480 Thr Ala Gly Asp Val Val Val Cys Ser Thr Asp Ala Leu Met Thr Gly 145150 155 160 TTC ACC GGC GAC TTT GAT TCT GTC ATA GAC TGC AAC TCC GCC GTCACT 528 Phe Thr Gly Asp Phe Asp Ser Val Ile Asp Cys Asn Ser Ala Val Thr165 170 175 CAG ACG GTG GAC TTC AGT CTG GAT CCC ACT TTT ACC ATT GAG ACTACC 576 Gln Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr Ile Glu Thr Thr180 185 190 ACA GTG CCC CAG GAC GCA GTG TCC AGA AGC CAG CGT AGG GGC CGCACG 624 Thr Val Pro Gln Asp Ala Val Ser Arg Ser Gln Arg Arg Gly Arg Thr195 200 205 GGG AGA GGT AGG CAC GGC ATA TAC CGG TAT GTC TCG GCT GGA GAGAGA 672 Gly Arg Gly Arg His Gly Ile Tyr Arg Tyr Val Ser Ala Gly Glu Arg210 215 220 CCG TCT GAC ATG TTC GAC TCC GTG GTG CTC TGT GAG TGC TAC GATGCC 720 Pro Ser Asp Met Phe Asp Ser Val Val Leu Cys Glu Cys Tyr Asp Ala225 230 235 240 GGA TGT GCG TGG TAT GAT CTG ACT CCT GCC GAG ACT ACC GTGAGG TTG 768 Gly Cys Ala Trp Tyr Asp Leu Thr Pro Ala Glu Thr Thr Val ArgLeu 245 250 255 CGC GCT TAC ATA AAC ACC CCC GGG CTC CCT GTC TGT CAG GACCAT TTG 816 Arg Ala Tyr Ile Asn Thr Pro Gly Leu Pro Val Cys Gln Asp HisLeu 260 265 270 GAA TTC TGG GAG GGG GTG TTC ACG GGG CTC ACT AAC ATC GACGCT CAC 864 Glu Phe Trp Glu Gly Val Phe Thr Gly Leu Thr Asn Ile Asp AlaHis 275 280 285 ATG CTG TCA CAG ACC AAA CAG GGT GGG GAG AAT TTC CCA TACCTT GTA 912 Met Leu Ser Gln Thr Lys Gln Gly Gly Glu Asn Phe Pro Tyr LeuVal 290 295 300 GCG TAC CAA GCA ACA GTC TGT GTT CGC GCG AAA GCG CCC CCCCCC AGC 960 Ala Tyr Gln Ala Thr Val Cys Val Arg Ala Lys Ala Pro Pro ProSer 305 310 315 320 TGG GAC ACA ATG TGG AAA TGC ATG CTC CGT CTC AAA CCGACT TTA ACT 1008 Trp Asp Thr Met Trp Lys Cys Met Leu Arg Leu Lys Pro ThrLeu Thr 325 330 335 GGC CCT ACT CCC CTC TTG TAC AGG CTG GGG CCC GTC CAGAAT GAG ATC 1056 Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Pro Val Gln AsnGlu Ile 340 345 350 ACA CTG ACG CAC CCC ATC ACC AAG TAC ATT ATG GCT TGCATG TCT GCG 1104 Thr Leu Thr His Pro Ile Thr Lys Tyr Ile Met Ala Cys MetSer Ala 355 360 365 GAC TTG GAG GTC ATT ACC AGC ACT TGG GTT CTG GTG GGGGGC GTT GTG 1152 Asp Leu Glu Val Ile Thr Ser Thr Trp Val Leu Val Gly GlyVal Val 370 375 380 GCG GCC CTG GCG GCC TAC TGC TTG ACG GTG GGT TCG GTAGCC ATA GTC 1200 Ala Ala Leu Ala Ala Tyr Cys Leu Thr Val Gly Ser Val AlaIle Val 385 390 395 400 GGT AGG ATC ATC CTC TCT GGG AAA CCT GCC ATC ATTCCC GAT AGG GAG 1248 Gly Arg Ile Ile Leu Ser Gly Lys Pro Ala Ile Ile ProAsp Arg Glu 405 410 415 GCA TTA TAC CAG CAA TTT GAT GAG ATG GAG GAG TGCTCG GCC TCG TTG 1296 Ala Leu Tyr Gln Gln Phe Asp Glu Met Glu Glu Cys SerAla Ser Leu 420 425 430 CCC TAT ATG GAC GAG ACA CGT GCC ATT GCC GGA CAATTC AAA GAG AAA 1344 Pro Tyr Met Asp Glu Thr Arg Ala Ile Ala Gly Gln PheLys Glu Lys 435 440 445 GTG CTC GGC TTC ATC AGC ACG ACC GGC CAG AAG GCTGAA ACT CTG AAG 1392 Val Leu Gly Phe Ile Ser Thr Thr Gly Gln Lys Ala GluThr Leu Lys 450 455 460 CCG GCA GCC ACG TCT GTG TGG AAC AAG GCT GAG CAGTTC TGG GCC ACA 1440 Pro Ala Ala Thr Ser Val Trp Asn Lys Ala Glu Gln PheTrp Ala Thr 465 470 475 480 TAC 1443 Tyr (2) INFORMATION FOR SEQ ID NO:270: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 481 amino acids (B) TYPE:amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (xi)SEQUENCE DESCRIPTION: SEQ ID NO: 270: Thr Ile Thr Thr Gly Ala Ser IleThr Tyr Ser Thr Tyr Gly Lys Phe 1 5 10 15 Leu Ala Asp Gly Gly Cys SerGly Gly Ala Tyr Asp Val Ile Ile Cys 20 25 30 Asp Glu Cys His Ser Gln AspAla Thr Thr Ile Leu Gly Ile Gly Thr 35 40 45 Val Leu Asp Gln Ala Glu ThrAla Gly Ala Arg Leu Val Val Leu Ala 50 55 60 Thr Ala Thr Pro Pro Gly SerVal Thr Thr Pro His Pro Asn Ile Glu 65 70 75 80 Glu Val Ala Leu Pro GlnGlu Gly Glu Val Pro Phe Tyr Gly Arg Ala 85 90 95 Ile Pro Leu Ala Phe IleLys Gly Gly Arg His Leu Ile Phe Cys His 100 105 110 Ser Lys Lys Lys CysAsp Glu Leu Ala Lys Gln Leu Thr Ser Leu Gly 115 120 125 Val Asn Ala ValAla Tyr Tyr Arg Gly Leu Asp Val Ala Val Ile Pro 130 135 140 Thr Ala GlyAsp Val Val Val Cys Ser Thr Asp Ala Leu Met Thr Gly 145 150 155 160 PheThr Gly Asp Phe Asp Ser Val Ile Asp Cys Asn Ser Ala Val Thr 165 170 175Gln Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr Ile Glu Thr Thr 180 185190 Thr Val Pro Gln Asp Ala Val Ser Arg Ser Gln Arg Arg Gly Arg Thr 195200 205 Gly Arg Gly Arg His Gly Ile Tyr Arg Tyr Val Ser Ala Gly Glu Arg210 215 220 Pro Ser Asp Met Phe Asp Ser Val Val Leu Cys Glu Cys Tyr AspAla 225 230 235 240 Gly Cys Ala Trp Tyr Asp Leu Thr Pro Ala Glu Thr ThrVal Arg Leu 245 250 255 Arg Ala Tyr Ile Asn Thr Pro Gly Leu Pro Val CysGln Asp His Leu 260 265 270 Glu Phe Trp Glu Gly Val Phe Thr Gly Leu ThrAsn Ile Asp Ala His 275 280 285 Met Leu Ser Gln Thr Lys Gln Gly Gly GluAsn Phe Pro Tyr Leu Val 290 295 300 Ala Tyr Gln Ala Thr Val Cys Val ArgAla Lys Ala Pro Pro Pro Ser 305 310 315 320 Trp Asp Thr Met Trp Lys CysMet Leu Arg Leu Lys Pro Thr Leu Thr 325 330 335 Gly Pro Thr Pro Leu LeuTyr Arg Leu Gly Pro Val Gln Asn Glu Ile 340 345 350 Thr Leu Thr His ProIle Thr Lys Tyr Ile Met Ala Cys Met Ser Ala 355 360 365 Asp Leu Glu ValIle Thr Ser Thr Trp Val Leu Val Gly Gly Val Val 370 375 380 Ala Ala LeuAla Ala Tyr Cys Leu Thr Val Gly Ser Val Ala Ile Val 385 390 395 400 GlyArg Ile Ile Leu Ser Gly Lys Pro Ala Ile Ile Pro Asp Arg Glu 405 410 415Ala Leu Tyr Gln Gln Phe Asp Glu Met Glu Glu Cys Ser Ala Ser Leu 420 425430 Pro Tyr Met Asp Glu Thr Arg Ala Ile Ala Gly Gln Phe Lys Glu Lys 435440 445 Val Leu Gly Phe Ile Ser Thr Thr Gly Gln Lys Ala Glu Thr Leu Lys450 455 460 Pro Ala Ala Thr Ser Val Trp Asn Lys Ala Glu Gln Phe Trp AlaThr 465 470 475 480 Tyr

1. A composition comprising or consisting of at least one polynucleicacid containing 8 or more contiguous nucleotides selected from at leastone of the following HCV sequences: an HCV type 3 genomic sequence, moreparticularly in any of the following regions: the region spanningpositions 417 to 957 of the Core/E1 region of HCV subtype 3 a, theregion spanning positions 4664 to 4730 of the NS3 region of HCV type 3,the region spanning positions 4892 to 5292 of the NS3/4 region of HCVtype 3, the region spanning positions 8023 to 8235 of the NS5 region ofHCV subtype 3 a, an HCV subtype 3 c genomic sequence, an HCV subtype 2 dgenomic sequence, an HCV type 4 genomic sequence, the coding region ofHCV subtype 5 a, with said nucleotide numbering being with respect tothe numbering of HCV nucleic acids as shown in Table 1, and with saidpolynucleic acids containing at least one nucleotide difference withknown HCV polynucleic acid sequences in the above-indicated regions, orthe complement thereof.
 2. A composition according to claim 1, whereinsaid polynucleic acids correspond to a nucleotide sequence selected fromany of the following HCV genomic sequences: an HCV genomic sequence ashaving a homology of at least 67%, preferably more than 69%, mostpreferably 71% or more to any of the sequences as represented in SEQ IDNO 13, 15, 17, 19, 21, 23, 25 or 27 in the region spanning positions 417to 957 of the Core/E1 region; an HCV genomic sequence as having ahomology of at least 65%, preferably more than 67%, most preferably 69%or more to any of the sequences as represented in SEQ ID NO 19, 21, 23,25 or 27 in the region spanning positions 574 to 957 of the E1 region;an HCV genomic sequence, having a homology of at least 79%, morepreferably at least 81%, most preferably more than 83% or more to any ofthe sequences as represented in SEQ ID NO 147 in the region spanningpositions 1 to 378 of the Core region, an HCV genomic sequence having ahomology of at least 74%, more preferably at least 76%, most preferablymore than 78% or more to any of the sequences as represented in SEQ IDNO 13, 15, 17, 19, 21, 23, 25 or 27 in the region spanning positions 417to 957 in the Core/E1 region; an HCV genomic sequence having a homologyof at least 74%, preferably more than 76%, most preferably 78% or moreto any of the sequences as represented in SEQ ID NO 13, 15, 17, 19, 21,23, 25 or 27 in the region spanning positions 574 to 957 in the E1region, an HCV genomic sequence having a homology of more than 73.5%,preferably more than 74%, most preferably 75% homology to any of thesequence as represented in SEQ ID NO 29 in the region spanning positions4664 to 4730 of the NS3 region; an HCV genomic sequence having ahomology of more than 70%, preferably more than 72%, most preferablymore than 74% homology to any of the sequences as represented in SEQ IDNO 29, 31, 33, 35, 37 or 39 in the region spanning positions 4892 to5292 in the NS3/NS4 region; an HCV genomic sequence having a homology ofmore than 95%, preferably 95,5%, most preferably 96% homology to any ofthe sequences as represented in SEQ ID NO 5, 7, 1, 3, 9 or 11 in theregion spanning positions 8023 to 8235 of the NS5 region; an HCV genomicsequence of the BR36 subgroup of HCV type 3 a having a homology of morethan 96%, preferably 96.5%, most preferably 97% homology to any of thesequences as represented in SEQ ID NO 5, 7, 1, 3, 9 or 11 in the regionspanning positions 8023 to 8192 of the NS5B region; an HCV genomicsequence having a homology of more than 79%, more preferably more than81%, and most preferably more than 83% to the sequence as represented inSEQ ID NO 149 in the region spanning positions 7932 to 8271 in the NS5Bregion.
 3. A composition according to claim 1, wherein said polynucleicacids correspond to a nucleotide sequence selected from any of thefollowing HCV genomic sequences: an HCV genomic sequence having ahomology of more than 85%, preferably more than 86%, most preferablymore than 87% homology to any of the sequences as represented in SEQ IDNO 41, 43, 45, 47, 49, 51, 53 or 151 in the region spanning positions 1to 573 of the Core region; an HCV genomic sequence having a homology ofmore than 61%, preferably more than 63%, most preferably more than 65%homology to any of the sequences as represented in SEQ ID NO 41, 43, 45,47, 49, 51, 53, 153 or 155 in the region spanning positions 574 to 957of the E1 region; an HCV genomic sequence having a homology of more than76.5%, preferably of more than 77%, most preferably of more than 78%homology with any of the sequences as represented in SEQ ID NO 55, 57,197 or 199 in the region spanning positions 3856 to 4209 of the NS3region; an HCV genomic sequence having a homology of more than 68%,preferably of more than 70%, most preferably of more than 72% homologywith the sequence as represented in SEQ ID NO 157 in the region spanningpositions 980 to 1179 of the E1/E2 region; an HCV genomic sequencehaving a homology of more than 57%, preferably more than 59%, mostpreferably more than 61% homology to any of the sequences as representedin SEQ ID NO 59 or 61 in the region spanning positions 4936 to 5296 ofthe NS4 region; an HCV genomic sequence having a homology of more than93%, preferably more than 93.5%, most preferably more than 94% homologyto any of the sequences as represented in SEQ ID NO 159 or 161 in theregion spanning positions 7932 to 8271 of the NS5B region.
 4. Acomposition according to claim 1, wherein said polynucleic acidscorrespond to a nucleotide sequence selected from any of the followingHCV genomic sequences: an HCV genomic sequence having a homology of morethan 66%, preferably more than 68%, most preferably more than 70%homology in the E1 region spanning positions 574 to 957 to any of thesequences as represented in SEQ ID NO 118, 120 or 122 in the regionspanning positions 1 to 957 of the Core/E1 region; an HCV genomicsequence having a homology of more than 71%, preferably more than 72%,most preferably more than 74% homology to any of the sequences asrepresented in SEQ ID NO 118, 120 or 122 in the region spanningpositions 379 to 957; an HCV genomic sequence having a homology of morethan 85%, preferably more than 86%, most preferably more than 86.5%homology to any of the sequences as represented in SEQ ID NO 183, 185 or187 in the region spanning positions 379 to 957 of the E1 region; an HCVgenomic sequence having a homology of more than 81%, preferably morethan 83%, most preferably more than 85% homology to the sequence asrepresented in SEQ ID NO 189 in the region spanning positions 379 to 957of the E1 region; an HCV genomic sequence having a homology of more than85%, preferably more than 87%, most preferably more than 89% homology toany of the sequences as represented in SEQ ID NO 167 or 169 in theregion spanning positions 379 to 957 of the E1 region; an HCV genomicsequence having a homology of more than 79%, preferably more than 81%,most preferably more than 83% homology to any of the sequences asrepresented in SEQ ID NO 171 or 173 in the region spanning positions 379to 957 of the E1 region; an HCV genomic sequence having a homology ofmore than 84%, preferably more than 86%, most preferably more than 88%homology to the sequence as represented in SEQ ID NO 175 in the regionspanning positions 379 to 957 of the E1 region; an HCV genomic sequencehaving a homology of more than 83%, preferably more than 85%, mostpreferably more than 87% homology to the sequence as represented in SEQID NO 177 in the region spanning positions 379 to 957 of the E1 region,an HCV genomic sequence having a homology of more than 76%, preferablymore than 78%, most preferably more than 80% homology to the sequence asrepresented in SEQ ID NO 179 in the region spanning positions 379 to 957of the E1 region; an HCV genomic sequence having a homology of more than84%, preferably more than 86%, most preferably more than 88% homology tothe sequence as represented in SEQ ID NO 181 in the region spanningpositions 379 to 957 of the E1 region; an HCV genomic sequence having ahomology of more than 73%, preferably more than 75%, most preferablymore than 77% homology to any of the sequences as represented in SEQ IDNO 106, 108, 110, 112, 114, or 116 in the region spanning positions 7932to 8271 of the NS5 region; an HCV genomic sequence having a homology ofmore than 88%, preferably more than 89%, most preferably more than 90%homology to any of the sequences as represented in SEQ ID NO 106, 108,110, or 112 in the region spanning positions 7932 to 8271 of the NS5region; an HCV genomic sequence having a homology of more than 88%,preferably more than 89%, most preferably more than 90% homology to anyof the sequences as represented in SEQ ID NO 116 or 201 in the regionspanning positions 7932 to 8271 of the NS5 region; an HCV genomicsequence having a homology of more than 87%, preferably more than 89%,most preferably more than 90% homology to the sequence as represented inSEQ ID NO 203 in the region spanning positions 7932 to 8271 of the NS5region; an HCV genomic sequence having a homology of more than 85%,preferably more than 87%, most preferably more than 89% homology to thesequence as represented in SEQ ID NO 114 in the region spanningpositions 7932 to 8271 of the NS5 region; an HCV genomic sequence havinga homology of more than 86%, preferably more than 87%, most preferablymore than 88% homology to the sequence as represented in SEQ ID NO 207in the region spanning positions 7932 to 8271 of the NS5 region; an HCVgenomic sequence having a homology of more than 84%, preferably morethan 86%, most preferably more than 88% homology to the sequence asrepresented in SEQ ID NO 209 in the region spanning positions 7932 to8271 of the NS5 region, an HCV genomic sequence having a homology ofmore than 81%, preferably more than 83%, most preferably more than 85%homology to the sequence as represented in SEQ ID NO 211 in the regionspanning positions 7932 to 8271 of the NS5 region
 5. A compositionaccording to claim 1, wherein said polynucleic acids correspond to anucleotide sequence selected from any of the following HCV genomicsequences an HCV genomic sequence having a homology of more than 78%,preferably more than 80%, most preferably more than 82% homology to thesequence as represented in SEQ ID NO 143 in the region spanningpositions 379 to 957 of the Core/E1 region, an HCV genomic sequencehaving a homology of more than 74%, preferably more than 76%, mostpreferably more than 78% homology to the sequence as represented in SEQID NO 143 in the region spanning positions 574 to 957; an HCV genomicsequence having a homology of more than 87%, preferably more than 89%,most preferably more than 91% homology to the sequence as represented inSEQ ID NO 145 in the region spanning positions 7932 to 8271 of the NS5Bregion
 6. A composition according to any of claims 1 to 5, wherein saidpolynucleic acid is liable to act as a primer for amplifying the nucleicacid of a certain isolate belonging to the genotype from which theprimer is derived.
 7. A composition according to any of claims 1 to 5,wherein said polynucleic acid is able to act as a hybridization probefor specific detection and/or classification into types of a nucleicacid containing said nucleotide sequence, with said oligonucleotidebeing possibly labelled or attached to a solid substrate.
 8. Use of acomposition according to any of claims 1 to 7 for in vitro detecting thepresence of one or more HCV genotypes, more particularly for detectingthe presence of a nucleic acid of any of the HCV genotypes having anucleotide sequence as defined in any of claims 1 to 5, present in abiological sample liable to contain them, comprising at least thefollowing steps: (i) possibly extracting sample nucleic acid, (ii)possibly amplifying the nucleic acid with at least one of the primersaccording to claim 6 or any other HCV type 2, HCV type 3, HCV type 4,HCV type 5 or universal HCV primer, (iii) hybridizing the nucleic acidsof the biological sample, possibly under denatured conditions, and withsaid nucleic acids being possibly labelled during or afteramplification, at appropriate conditions with one or more probesaccording to claim 7, with said probes being preferably attached to asolid substrate, (iv) washing at appropriate conditions, (v) detectingthe hybrids formed, (vi) inferring the presence of one or more HCVgenotypes present from the observed hybridization pattern.
 9. Acomposition consisting of or comprising at least one peptide orpolypeptide containing in its sequence a contiguous sequence of at least5 amino acids of an HCV polyprotein encoded by any of the polynucleicacids according to any of claims 1 to
 5. 10. A composition according toclaim 9, wherein said contiguous sequence contains in its sequence atleast one of the following amino acid residues: L7, Q43, M44, S60, R67,Q70, T71, A79, A87, N106, K115, A127, A190, S130, V134, G142, I144,E152, A157, V158, P165, S177 or Y177, I178, V180 or E180 or F182, R184,I186, H187, T189, A190, S191 or G191, Q192 or L192 or I192 or V192 orE192, N193 or H193 or P193, W194 or Y194, H195, A197 or I197 or V197 orT197, V202, 1203 or L203, Q208, A210, V212, F214, T216, R217 or D217 orE217 or V217, H218 or N218, H219 or V219 or L219, L227 or L227, M231 orE231 or Q23 1, T232 or D232 or A232 or K232, Q235 or I235, A237 or T237,I242, I246, S247, S248, V249, S250 or Y250, I251 or V251 or M251 orF251, D252, T254 or V254, L255 or V255, E256 or A256, M258 or F258 orV258, A260 or Q260 or S260, A261, T264 or Y264, M265, I266 or A266,A267, G268 or T268, F271 or M271 or V271, I277, M280 or H280, I284 orA284 or L84, V274, V291, N292 or S292, R293 or I293 or Y293, Q294 orR294, L297 or I297 or Q297, A299 or K299 or Q299, N303 or T303, T308 orL308, T310 or F310 or A310 or D310 or V310, L313, G317 or Q317, L333,S351, A358, A359, A363, S364, A366, T369, L373, F376, Q386, I387, S392,I399, F402, 1403, R405, D454, A461, A463, T464, K484, Q500, E501, S521,K522, H524, N528, S531, S532, V534, F536, F537, M539, I546, C1282,A1283, H1310, V1312, Q1321, P1368, V1372, V1373, K1405, Q1406, S1409,A1424, A1429, C1435, S1436, S1456, H1496, A1504, D1510, D1529, I1543,N1567, D1556, N1567, M1572, Q1579, L1581, S1583, F1585, V1595, E1606 orT1606, M1611, V1612 or L1612, P1630, C1636, P1651, T1656 or I1656,L1663, V1667, V1677, A1681, H1685, E1687, G1689, V1695, A1700, Q1704,Y1705, A1713, A1714 or S1714, M1718, D1719, A1721 or T1721, R1722, A1723or V1723, H1726 or G1726, E1730, V1732, F1735, I1736, S1737, R1738,T1739, G1740, Q1741, K1742, Q1743, A1744, T1745, L1746, E1747 or K1747,I1749, A1750, T1751 or A1751, V1753, N1755, K1756, A1757, P1758, A1759,H1762, T1763, Y1764, P2645, A2647, K2650, K2653 or L2653, S2664, N2673,F2680, K2681, L2686, H2692, Q2695 or L2695 or 12695, V2712, F2715, V2719or Q2719, T2722, T2724, S2725, R2726, G2729, Y2735, H2739, I2748, G2746or I2746, I2748, P2752 or K2752, P2754 or T2754, T2757 or P2757, withsaid notation being composed of a letter representing the amino acidresidue by its one-letter code, and a number representing the amino acidnumbering according to Kato et al., 1990 as shown in Table
 1. 11. Acomposition according to any of claims 9 or 10, wherein said contiguoussequence is selected from any of the following HCV amino acid sequences:a sequence having a homology of more than 72%, preferably more than 74%,and most preferably more than 77% homology to any of the amino acidsequences as represented in SEQ ID NO 14, 16, 18, 20, 22, 24, 26 or 28in the region spanning positions 140 to 319 in the Core/E1 region; asequence having a homology of more than 70%, preferably more than 72%,and most preferably more than 75% homology to any of the amino acidsequences as represented in SEQ ID NO 14, 16, 18, 20, 22, 24, 26 or 28in the E1 region spanning positions 192 to 319; a sequence having ahomology of more than 86%, preferably more than 88%, and most preferablymore than 90% homology to the amino acid sequences as represented in SEQID NO 148 in the region spanning positions 1 to 110 in the Core region;a sequence having a homology of more than 76%, preferably more than 78%,most preferably more than 80% to any of the amino acid sequences asrepresented in SEQ ID NO 30, 32, 34, 36, 38 or 40 in the region spanningpositions 1646 to 1764 in the NS3/NS4 region; a sequence having ahomology of more than 81.5%, preferably more than 83%, and mostpreferably more than 86% homology to any of the amino acid sequences asrepresented in SEQ ID NO 14, 16, 18, 20, 22, 24, 26 or 28 in the E1region spanning positions 192 to 319; a sequence having a homology ofmore than 86%, preferably more than 88%, most preferably more than 90%to the amino acid sequence as represented in SEQ ID NO 150 in the regionspanning positions 2645 to 2757 in the NS5B region;
 12. A compositionaccording to any of claims 9 or 10, wherein said contiguous sequence isselected from any of the following HCV amino acid sequences: a sequencehaving a homology of more than 80%, preferably more than 82%, mostpreferably more than 84% homology to any of the amino acid sequences asrepresented in SEQ ID NO 118, 120, and 122 in the region spanningpositions 127 to 319, a sequence having a homology of more than 73%,preferably more than 75%, most preferably more than 78% homology in theE1 region spanning positions 192 to 319 to any of the amino acidsequences as represented in SEQ ID NO 118, 120, and 122, in the regionspanning positions 127 to 319, a sequence having more than 85%,preferably more than 86%, most preferably more than 87% homology to anyof the amino acid sequences as represented in SEQ ID NO 118, 120 or 122,in the region spanning positions 192 to
 319. 13. A composition accordingto any of claims 9 or 10, wherein said contiguous sequence is selectedfrom any of the following HCV amino acid sequences: a sequence havingmore than 93%, preferably more than 94%, most preferably more than 95%homology in the region spanning Core positions 1 to 191 to any of theamino acid sequences as represented in SEQ ID NO 42, 44, 46, 48, 50, 52,54, or 152; a sequence having more than 73%, preferably more than 74%,most preferably more than 76% homology in the region spanning E1positions 192 to 319 to any of the amino acid sequences as representedin SEQ ID NO 42, 44, 46, 48, 50, 52, 54, 154 or 156; a sequence spanningpositions 1286 to 1403 of the NS3 region, with said sequence beingcharacterized as having more than 90%, preferably more than 91%, mostpreferably more than 92% homology to any of the amino acid sequencesrepresented in SEQ ID NO 56 to 58; a sequence spanning positions 1646 to1764 of the NS3/4 region, with said sequence being characterized ashaving more than 66%, more particularly 68%, most particularly 70% ormore homology to any of the amino acid sequences as represented in SEQID NO 60 or
 14. A composition according to any of claims 9 to 10,wherein said contiguous sequence is selected from any of the followingHCV amino acid sequences: a sequence having a more than 83%, preferablymore than 85%, most preferably more than 87% homology in the regionspanning Core positions 1 to 319 to the amino acid sequence asrepresented in SEQ ID NO 144; a sequence having a more than 79%,preferably more than 81%, most preferably more than 84% homology in theregion spanning E1 positions 192 to 319 to the amino acid sequence asrepresented in SEQ D NO 144; a sequence having more than 95%, moreparticularly 96%, most particularly 97% or more homology to the aminoacid sequence as represented in SEQ ID NO 146, in the region spanningpositions 2645 to 2757 of the NS5B region.
 15. A composition accordingto any of claims 9 to 14, wherein said sequence is selected from thefollowing peptides: QPTGRSWGQ (SEQ ID NO 93) RSEGRTSWAQ (SEQ ID NO 220)RTEGRTSWAQ (SEQ ID NO 221) SRRQPIPRARRTEGRSWAQ (SEQ ID NO 268)LEWRNTSGLYVL (SEQ ID NO 83) VNYRNASGIYHI (SEQ ID NO 126) QHYRNISGIYHV(SEQ ID NO 127) EHYRNASGIYHI (SEQ ID NO 128) IHYRNASGIYHI (SEQ ID NO224) VPYRNASGIYHV (SEQ ID NO 84) VNYRNASGIYHI (SEQ ID NO 225)VNYRNASGVYHI (SEQ ID NO 226) VNYHNTSGIYHL (SEQ ID NO 227) QHYRNASGIYHV(SEQ ID NO 228) QHYRNVSGIYHV (SEQ ID NO 229) IHYRNASDGYYI (SEQ ID NO230) LQVKNTSSSYMV (SEQ ID NO 231) VYEADDVILHT (SEQ ID NO 85) VYETEHHILHL(SEQ ID NO 129) VYEADHHIMHL (SEQ ID NO 130) VYETDHHILHL (SEQ ID NO 131)VYEADNLILHA (SEQ ID NO 86) VWQLRAIVLHV (SEQ ID NO 232) VYEADYHILHL (SEQID NO 233) VYETDNHILHL (SEQ ID NO 234) VYETENHILHL (SEQ ID NO 235)VFETVHHILHL (SEQ ID NO 236) VFETEHHILHL (SEQ ID NO 237) VFETDHHIMHL (SEQID NO 238) VYETENHILHL (SEQ ID NO 239) VYEADALILHA (SEQ ID NO 240)VQDGNTSTCWTPV (SEQ ID NO 87) VQDGNTSACWTPV (SEQ ID NO 241) VRVGNQSRCWVAL(SEQ ID NO 132) VRTGNTSRCWVPL (SEQ ID NO 133) VRAGNVSRCWTPV (SEQ ID NO134) EEKGNISRCWIPV (SEQ ID NO 242) VKTGNQSRCWVAL (SEQ ID NO 243)VRTGNQSRCWVAL (SEQ ID NO 244) VKTGNQSRCWIAL (SEQ ID NO 245)VKTGNVSRCWIPL (SEQ ID NO 247) VKTGNVSRCWISL (SEQ ID NO 248)VRKDNVSRCWVQI (SEQ ID NO 249) VRYVGATTAS (SEQ ID NO 89) APYIGAPLES (SEQID NO 135) APYVGAPLES (SEQ ID NO 136) AVSMDAPLES (SEQ ID NO 137)APSLGAVTAP (SEQ ID NO 90) APSFGAVTAP (SEQ ID NO 250) VSQPGALTKG (SEQ IDNO 251) VKYVGATTAS (SEQ ID NO 252) APYIGAPVES (SEQ ID NO 253) AQHLNAPLES(SEQ ID NO 254) SPYVGAPLEP (SEQ ID NO 255) SPYAGAPLEP (SEQ ID NO 256)APYLGAPLEP (SEQ ID NO 257) APYLGAPLES (SEQ ID NO 258) APYVGAPLES (SEQ IDNO 259) VPYLGAPLTS (SEQ ID NO 260) APHLRAPLSS (SEQ ID NO 261) APYLGAPLTS(SEQ ID NO 262) RPRRHQTVQT (SEQ ID NO 91) QPRRHWTTQD (SEQ ID NO 138)RPRRHWTTQD (SEQ ID NO 139) RPRQHATVQN (SEQ ID NO 92) RPRQHATVQD (SEQ IDNO 263) SPQHHKFVQD (SEQ ID NO 264) RPRRLWTTQE (SEQ ID NO 265) PPRIHETTQD(SEQ ID NO 266) TISYANGSGPSDDK (SEQ ID NO 267)


16. Recombinant vector, particularly for cloning and/or expression, withsaid recombinant vector comprising a vector sequence, an appropriateprokaryotic, eukaryotic or viral promoter sequence followed by thenucleotide sequences as defined in claims 1 to 5, with said recombinantvector allowing the expression of any one of the HCV type 2 and/or HCVtype 3 and/or type 4 and/or type 5 derived polypeptides according to anyof claims 9 to 15 in a prokaryotic, or eukaryotic host, or in livingmammals when injected as naked DNA, and more particularly a recombinantvector allowing the expression of any of the following HCV type 2, HCVtype 3, type 4 or type 5 polypeptides spanning the following amino acidpositions: a polypeptide starting at position 1 and ending at anyposition in the region between positions 70 and 326, more particularly apolypeptide spanning positions 1 to 70, 1 to 85, positions 1 to 120,positions 1 to 150, positions 1 to 191, positions 1 to 200, forexpression of the Core protein, and positions 1 to 263, positions 1 to326, for expression of the Core and E1 protein; a polypeptide startingat any position in the region between positions 117 and 192, and endingat any position in the region between positions 263 and 326, moreparticularly from positions 119 to 326, for expression of E1, or formsthat have the putative membrane anchor deleted (positions 264 to 293plus or minus 8 amino acids); a polypeptide starting at any position inthe region between positions 1556 and 1688, and ending at any positionin the region between positions 1739 and 1764, for expression of the NS4regions, more particularly a polypeptide starting at position 1658 andending at position 1711 for expression of the NS4a antigen, and moreparticularly, a polypeptide starting at position 1712 and ending betweenpositions 1743 and 1972, for example 1712-1743, 1712-1764, 1712-1782,1712-1972, 1712 to 1782 and 1902 to 1972 for expression of the NS4bprotein or parts thereof.
 17. A composition according to any of claims 9to 15, wherein said polypeptide is a recombinant polypeptide expressedby means of an expression vector as defined in claim
 16. 18. Acomposition according to any of claims 9 to 15 or 16, for use in amethod for immunizing a mammal, preferably humans, against HCVcomprising administratering a sufficient amount of the compositionpossibly accompanied by pharmaceutically acceptable adjuvants, toproduce an immune response, more particularly a vaccine compositionincluding HCV type 3 polypeptides derived from the E1, Core, or NS4region and/or type 4 and/or type 5 and/or type 2 polypeptides. 19.Antibody raised upon immunization with a composition according to any ofclaims 9 to 15, 17 or 18, by means of a process according to claim 18,with said antibody being reactive with any of the polypeptides asdefined in any of claims 9 to 15, 17 or
 18. 20. Process for detecting invitro HCV present in biological sample liable to contain it, comprisingat least the following steps: (i) contacting the biological sample to beanalyzed for the presence of HCV antibodies with any of the compositionsaccording to claims 9 to 15, 17 or 18, preferentially in an immobilizedform under appropriate conditions which allow the formation of an immunecomplex, wherein said polypeptide is preferentially in the form of abiotinylated polypeptide and is covalently bound to a solid substrate bymeans of streptavidin or avidin complexes, (ii) removing unboundcomponents, (iii) incubating the immunecomplexes formed withheterologous antibodies, which specifically bind to the antibodiespresent in the sample to be analyzed, with said heterologous antibodieshaving conjugated to a detectable label under appropriate conditions,(iv) detecting the presence of said immunecomplexes visually or by meansof densitometry and inferring the HCV serotype(s) present from theobserved hybridization pattern.
 21. Use of a composition according toany of claims 9 to 15, 17 or 18, for incorporation into a serotypingassay for detecting one or more serological types of HCV present in abiological sample liable to contain it, more particularly for detectingE1 and NS4 antigens or antibodies of the different types to be detectedcombined in one assay format, comprising at least the following steps:(i) contacting the biological sample to be analyzed for the presence ofHCV antibodies or antigens of one or more serological types, with atleast one of the compositions according to claims 9 to 15, 17 or 18 inan immobilized form under appropriate conditions which allow theformation of an immunecomplex, (wherein said polypeptide ispreferentially in the form of a biotinylated polypeptide and iscovalently bound to a solid substrate by means of streptavidin or avidincomplexes), (ii) removing unbound components, (iii) incubating theimmunecomplexes formed with heterologous antibodies, which specificallybind to the antibodies present in the sample to be analyzed, with saidheterologous antibodies having conjugated to a detectable label underappropriate conditions, (iv) detecting the presence of saidimmunecomplexes visually or by means of densitometry and inferring theHCV serological types present from the observed binding pattern.
 22. Akit for determining the presence of HCV genotypes as defined in any ofclaims 1 to 5 present in a biological sample liable to contain them,comprising: possibly at least one primer composition containing anyprimer selected from those defined in claim 6 or any other HCV type 2and/or HCV type 3 and/or HCV type 4 and/or HCV type 5, or universal HCVprimers, at least one probe composition according to claim 7, preferablyin combination with other polypeptides or peptides from HCV type 1, type2 or other types of HCV, with said probes being preferentiallyimmobilized on a solid substrate, and more preferentially on one and thesame membrane strip, a buffer or components necessary for producing thebuffer enabling hybridization reaction between these probes and thepossibly amplified products to be carried out, a means for detecting thehybrids resulting from the preceding hybriziation, possibly alsoincluding an automated scanning and interpretation device for inferingthe HCV genotype(s) present in the sample from the observedhybridization pattern.
 23. A kit for determining the presence of HCVantibodies according to any of claims 9 to 15, 17 or 18 present in abiological sample liable to contain them, comprising: at least onepolypeptide composition according to any of claims 9 to 15, 17 or 18,with said polypeptides being preferentially immobilized on a solidsubstrate, and more preferentially on one and the same membrane strip, abuffer or components necessary for producing the buffer enabling bindingreaction between these polypeptides and the antibodies against HCVpresent in the biological sample, a means for detecting the immunecomplexes formed in the preceding binding reaction, possibly alsoincluding an automated scanning and interpretation device for inferingthe HCV genotype present in the sample from the observed bindingpattern.